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PROTEINS, GENES AND THEIR USE FOR 
DIAGNOSIS AND TREATMENT OF BREAST CANCER 


INTRODUCTION 

The present invention relates to the identification of proteins and protein isoforms 
5 that are associated with predisposition to Breast Cancer and its onset and development, and 
of genes encoding the same, and to their use for clinical screening, diagnosis, prognosis, 
therapy and prophylaxis, as well as for drug screening and drag development. 

RACK GROUND OF THE INVENTION 

10 Breast cancer is the most frequently diagnosed non-skin cancer among women in the 

United States. It is second only to lung cancer in cancer-related deaths. Approximately 
180,000 new cases of breast cancer will be diagnosed in 1997, and about 44,000 women are 
expected to die from the disease (National Cancer Institute, www.nci . g rg, USA, 1999). In 
the UK, breast cancer is by far the commonest cancer for women, with 34,600 new cases in 
15 1998 (Cancer Research Campaign, www.crc.org . UK, 2000). Ninety-nine percent of breast 

cancers occur in women. The risk of developing breast cancer steadily increases with age; 
the lifetime risk of developing breast cancer is estimated to be 1 in 8 for women in the US. 
The annual cost of breast cancer treatment in the United States is approximately $10 billion 
(Fuqua, et. al. 2000, American Association for Cancer Research, www.aacr.or g, USA). 

20 breast cancer incidence has been rising over the past five decades, but recently it has 

plateaued. This may reflect a period of earlier' detection of breast cancers by mammography. 
A number of established factors can increase a woman's risk of having the disease. These 
include older age, history of prior breast cancer, significant radiation exposure, strong family 
history of breast cancer, upper socioeconomic class, nulliparity, early menarche, late 
25 menopause, or age at first pregnancy greater than 30 years. Prolonged use of oral 

contraceptives earlier in life appears to increase risk slightly. Prolonged postmenopausal 
estrogen replacement increases the nsk 20 to 40%. It has been speculated that a decrease in 
the age at menarche, changing birth patterns, or a rise in the use of exogenous estrogens has 
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contributed to the increase in breast cancer incidence (Fuqua, et. al. 2000, American 
Association for Cancer Research, www.aacr.org . USA). 

Causes of Breast Cancer 

Breast cancer is a heterogeneous disease. Although female hormones play a 
significant role in driving the origin and evolution of many breast tumours, there are a 
number of other recognised and unknown factors involved. Perturbations in oncogenes 
identified include amplification of the HER-2 and the epidermal growth factor receptor 
genes, and overexpression of cyclin Dl. Overexpression of these oncogenes has been 
associated with a significantly poorer prognosis. Similarly, genetic alterations or the loss of 
tumour suppressor genes, such as the P 53 gene, have been well documented in breast cancer 
and are also associated with a poorer prognosis. Researchers have identified two genes, 
called BRCA1 and BRCA2, which are predictive of premenopausal familial breast cancer. 
Genetic risk assessment is now possible, which may enhance the identification of candidates 
for chemoprevention trials (Fuqua, et. al. 2000, American Association for Cancer Research, 
www.aacr.orp . USA). 

Diagnosis 

Early diagnosis of breast cancer is vital to secure the most favourable outcome for 
treatment. Many countries with advanced healthcare systems have instituted screening 
programmes for breast cancer. This typically takes the form of regular x-ray of the breast 
(mammography) during the 50-60 year old age interval where greatest benefit for this 
intervention has been shown. Some authorities have advocated the extension of such 
programmes beyond 60 and to the 40-49 age group. Health authorities in many countries 
have also promoted the importance of regular breast self-examination by women. 
Abnormalities detected during these screeening procedures and cases presenting as 
symptomatic would normally be confirmed by aspiration cytology, core needle biopsy with 
a stereotactic or ultrasound technique for nonpalpable lesions, or incisional or excisional 
biopsy. At the same time other information relevant to treatment options and prognosis, such 
as oestrogen (ER) and progesterone receptor (PR) status would be determined (National 
Cancer Institute, USA, 2000, Breast Cancer PDQ, www.nci.orgt . 
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Disease Staging and Prognosis 

Staging is the process of finding out how far the cancer has spread. The staging 
system of the American Joint Committee on Cancer (AJCC), also known as the TNM 
system, is the one used most often for breast cancer. The TNM system for staging gives 
5 three key pieces of information: 

The letter T followed by a number from 0 to 4 describes the tumours size and spread 
to the skin or chest wall under the breast. A higher number means a larger tumour and/or 
more spread to tissues near the breast. 

The letter N, followed by a number from 0 to 3, indicates whether the cancer has 
10 spread to lymph nodes near the breast and, if so, whether the affected nodes are adhered to 
other structures under the arm. 

The letter M, followed by a 0 or 1 , shows whether the cancer has metastasized to 
other organs of the body or to lymph nodes that are not next to the breast. 

To make this information somewhat clearer, the TNM descriptions can be grouped 
15 together into a simpler set of stages, labeled stage 0 through stage IV (0-4). In general, the 
lower the number, the less the cancer has spread. A higher number, such as stage IV (4), 
means a more serious cancer. (American Cancer Society, 2000, USA, www.cancer.org ) 


Breast Cancer Survival by Stage 


Stage 

5-year relative survival rate 

0 

100% 

1 

98% 

IIA 

88% 

IB 

76% 

IMA 

56% 

IIIB 

49% 

IV 

16% 


(American Cancer Society, 2000, USA, www.cancer.org) 

Although anatomic stage (size of primary tumour, axillary lymph node involvement) 
is an important prognostic factor, other characteristics may have predictive value. For 
example studies from the National Surgical Adjuvant Breast and Bowel Project (NSABP) 
and the International Breast Cancer Study Group (IBCSG) have shown that tumour nuclear 
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grade and histologic grade, respectively, are important indicators of outcome following 
adjuvant therapy for breast cancer. There is substantial evidence that oestrogen receptor 
status and measures of proliferative capacity of the primary tumour (thymidine labelling 
index or flow cytometric measurements of S-phase and ploidy) may have important 
independent predictive value. In stage II disease, the PR status may have greater prognostic 
value than the ER status. Tumour vascularisation, c-erbB-2, c-myc, p53 expression, and 
lymphatic vessel invasion may also be prognostic indicators in patients with breast cancer 

(National Cancer Institute, USA, 2000, Breast Cancer PDQ, www.nci.orP and references 
therein). 


Thg Need for Improved Diagnos tic Tools in Breast Cancer Detection and Therapy 
Although there are signs that benefits are accruing from the more vigorous 
application of existing screening methods such as targeted mammography and self- 
examination combined with public awareness programs, these approaches have limitations 
m the drive to detect breast cancer as early as possible. An important factor limiting the 
spread of mammographic screening and its extension to wider age groups is cost. 
Mammography requires expensive x-ray equipment and highly trained specialists to operate 
it and interpret mammograms. In addition, suspicious lesions detected by mammography 
currently need to be confirmed or cleared as benign by biopsy. This is an invasive procedure 
that requires subsequent expert histological examination and interpretation, and can delay 
definitive diagnosis. Once breast cancer has been diagnosed, the success of therapeutic 
interventions such as surgery, radiation and chemotherapy in stabilising or eliminating the 
disease can be difficult to establish. It can be particularly difficult to determine the extent of 
any residual disease in patients during remission and to make the important early discovery 
of any relaspe into active disease. Both screening for and confirming the presence of breast 
cancer, and monitoring response to therapy, would be greatly aided by the application of a 
reliable and sensitive test that could detect the disease in serum samples. 

Serum Protein Changes in the Detection of Dkmcp 

There are two types of changes in serum protein patterns that can potentially aid 
diagnosis and disease monitoring. The first of these is the detection in serum of novel 



WO 01/13117 


PCT/GBOO/03143 


proteins, not normally present, that have been shed into the serum from the cancer cells. The 
second type of change that can be of diagnostic significance is the detection of specific 
reactive proteins in the serum produced by the body in response to the disease. An example 
of a protein that can be shed into the serum by some breast cancer cells is a fragment of the 
growth factor receptor known as c-erbB2/HER2/neu, which is present in small amounts on 
the surface of normal breast cells and at much higher levels in some breast cancers (Payne et 
al., 2000, Clin. Chem. 46:175-182). A second example of a protein shed into serum by a 
cancer that has diagnostic or prognostic significance is prostate serum antigen or PSA, 
which is used in the diagnosis and monitoring of prostate cancer (Fowler et al., 2000, J. 

Urol. 163:813-818). A further example of a protein shed into serum by several types of 
cancer that can be of diagnostic or prognostic significance is carcino-embryonic antigen or 
CEA (Lumachi et al., 1999, Anticancer Res, 5C: 4485-4489). The current value of these 
markers for diagnosis is limited by their lack of specificity and sensitivity, and these is a 
need to discover new markers that can better satisfy these criteria. 

A number of reactive proteins collectively termed acute phase proteins, show a 
dramatic increase or decrease in concentration in serum in response to early alarm 
inflammatory mediators such as IL-1 released in response to tissue injury including cancer, 
or infection. An example of a reactive protein present in serum in response to disease that 
has diagnostic or prognostic significance is serum amyloid A or SAA in rheumatoid arthritis 
(Cunnane et al, 2000, J. Rheumatol. 27:56-63). Sensitive detection of selected examples of 
such proteins could also assist in the diagnosis of breast cancer. Due to the high rates at 
which other disorders co-occur with breast cancer, the time-consuming nature of existing, 
largely inadequate tests and their expense, it would ne highly desirable to measure a 
substance or substances in samples of serum, blood or urine that would lead to a positive 
diagnosis of breast cancer or that would help to exclude breast cancer from the differential 

diagnosis. v 

Therefore a need exists to identify breast cancer associated proteins as sensitive and 
specific biomarkers for the diagnosis, to assess severity, to predict the outcome of breast 
cancer in living subject, and to monitor the treatment of breast cancer. Additionally, these is 
a clear need for new therapeutic agents for breast cancer that work quickly, potently, 
specifically, and with fewer side effects. 
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SUMMARY OF THE INVFNTtnN 

The present invention provides methods and compositions for clinical screening, 

5 diagnosis, prognosis, therapy and prophylaxis of breast cancer, for monitoring the 
effectiveness of breast cancer treatment, for selecting participants in clinical trials, for 
selecting patients most likely to respond to a particaulr therapeutic treatment, and for 
screening and development of drugs for treatment of breast cancer. 

A first aspect of the invention provides methods for diagnosis of breast cancer that 
10 comprise analyzing a sample of serum by two-dimensional electrophoresis to detect the 

presence or level of at least one Breast Cancer-Associated Feature (BF), eg, one or more of 
the BFs disclosed herein, or any combination thereof. These methods are also suitable for 
clinical screening, prognosis, monitoring the results of therapy, identifying patients most 
likely to respond to a specific therapeutic treatment, drug screening and development, and 
1 5 identification of new targets for drug treatment. 

A second aspect of the invention provides methods for diagnosis of breast cancer that 
comprise detecting in a sample of serum the presence or level of at least one Breast Cancer- 
Associated Protein Isoform (BPI), e.g., one or more of the BPIs disclosed herein or any 
combination thereof. These methods are also suitable for clinical screening, prognosis, 

20 monitoring the results of therapy, identifying patients most likely to respond to specific 

therapeutic treatments, drug screening and development, and identification of new targets for 
drug treatment. 

A third aspect of the invention provides monoclonal and polyclonal antibodies 
capable of immunospecific binding to a BPI, e.g., a BPI disclosed herein. 

2 5 A fourth aspect of the invention provides a preparation comprising an isolated BPI, 

«.<?., a BPI free from proteins or protein isoforms having a significantly different isoelectric 
point or a significantly different apparent molecular weight from the BPI. 

A fifth aspect of the invention provides methods of treating breast cancer, comprising 
administering to a subject a therapeutically effective amount of an agent that modulates (/.e., 

30 upregulates or downregulates) the expression or activity (e.g. enzymatic or binding activity), 
or both, of a BPI in subjects having breast cancer, in order to prevent or delay the onset or 
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development of breast cancer, to prevent or delay the progression of breast cancer, or to 

ameliorate the symptoms of breast cancer. 

A sixth aspect of the invention provides methods of screening for agents that 
modulate (i.e., upregulate or downregulate) the expression or the enzymatic or binding 
activity of a BPI, a BPI analog, or a BPI-related polypeptide. 

3.1. DEFINITIONS 

The term “BPI analog” as used herein refers to a polypeptide that possesses a similar 
or identical function as a BPI but need not necessarily comprise an amino acid sequence that 
is similar or identical to the amino acid sequence of the BPI, or possess a structure that is 
similar or identical to that of the BPI. As used herein, an amino acid sequence of a 
polypeptide is "similar" to that of a BPI if it satisfies at least one of the following criteria: 

(a) the polypeptide has an amino acid sequence that is at least 30% (more preferably, at least 
35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at 
least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% or at least 
99%) identical to the amino acid sequence of the BPI; (b) the polypeptide is encoded by a 
nucleotide sequence that hybridizes under stringent conditions to a nucleotide sequence 
encoding at least 5 amino acid residues (more preferably, at least 10 amino acid residues, at 
least 15 amino acid residues, at least 20 amino acid residues, at least 25 amino acid residues, 
at least 40 amino acid residues, at least 50 amino acid residues, at least 60 ammo residues, at 
least 70 amino acid residues, at least 80 amino acid residues, at least 90 amino acid residues, 
at least 100 amino acid residues, at least 125 amino acid residues, or at least 150 amino acid 
residues) of the BPI; or (c) the polypeptide is encoded by a nucleotide sequence that is at 
least 30% (more preferably, at least 35%, at least 40%, at least 45%, at least 50%, at least 
55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at 
least 90%, at least 95% or at least 99%) identical to the nucleotide sequence encoding the 

BPI. As used herein, a polypeptide with "similar structure to that of a BPI refers to a 

polypeptide that has a similar secondary, tertiary or quartemary structure as that of the BPI. 
The structure of a polypeptide can determined by methods known to those skilled in the art, 
including but not limited to. X-ray crystallography, nuclear magnetic resonance, and 
crystallographic electron microscopy. 
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The term “BPI fusion protein’* as used herein refers to a polypeptide that comprises 
(i) an amino acid sequence of a BPI, a BPI fragment, a BPI-related polypeptide or a 
fragment of a BPI-related polypeptide and (ii) an amino acid sequence of a heterologous 
polypeptide (4e, a non-BPI, non-BPI fragment or non-BPI-related polypeptide). 

The teim “BPI homolog” as used herein refers to a polypeptide that comprises an 
amino acid sequence similar to that of a BPI but does not necessarily possess a similar or 
identical function as the BPI. 

The term “BPI ortholog” as used herein refers to a non-human polypeptide that (i) 
comprises an amino acid sequence similar to that of a BPI and (ii) possesses a similar or 
identical function to that of the BPI. 

The term “BPI-related polypeptide” as used herein refers to a BPI homolog, an API 
analog, an isoform of BPI, a BPI ortholog, or any combination thereof. 

The term “derivative” as used herein refers to a polypeptide that comprises an amino 
acid sequence of a second polypeptide which has been altered by the introduction of amino 
acid residue substitutions, deletions or additions. The derivative polypeptide possess a 
similar or identical function as the second polypeptide. 

The term “fragment” as used herein refers to a peptide or polypeptide comprising an 
amino acid sequence of at least 5 amino acid residues (preferably, at least 10 amino acid 
residues, at least 15 amino acid residues, at least 20 amino acid residues, at least 25 amino 
acid residues, at least 40 amino acid residues, at least 50 amino acid residues, at least 60 
amino residues, at least 70 amino acid residues, at least 80 amino acid residues, at least 90 
amino acid residues, at least 100 amino acid residues, at least 125 amino acid residues, at 
least 150 amino acid residues, at least 175 amino acid residues, at least 200 amino acid 
residues, or at least 250 amino acid residues) of the amino acid sequence of a second 
polypeptide. The fragment of a BPI may or may not possess a functional activity of the a 
second polypeptide. 

The term “fold change” includes "fold increase" and "fold decrease" and refers to the 
relative increase or decrease in abundance of an BF or the relative increase or decrease in 
expression or activity of a polypeptide (e.g. a BPI) in a first sample or sample set compared 
to a second sample (or sample set). An BF or polypeptide fold change may be measured by 
any technique known to those of skill in the art, however the observed increase or decrease 
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will vary depending upon the technique used. Preferably, fold change is determined herein 
as described in the Examples infra. 

The term “isoform” as used herein refers to variants of a polypeptide that are 
encoded by the same gene, but that differ in their pi or MW; or both. Such isoforms can 
5 differ in their amino acid composition (e.g. as a result of alternative splicing or limited 
proteolysis) and in addition, or in the alternative, may arise from differential post- 
translational modification {e.g., glycosylation, acylation, phosphorylation). 

The term “modulate” when used herein in reference to expression or activity of a BPI 
or a BPI-related polypeptide refers to the upregulation or downregulation of the expression 
10 or activity of the BPI or a BPI-related polypeptide. Based on the present disclosure, such 
modulation can be determined by assays known to those of skill in the art or described 
herein. 

The percent identity of two amino acid sequences or of two nucleic acid sequences is 
determined by aligning the sequences for optimal comparison puiposes {e.g., gaps can be 
1 5 introduced in the first sequence for best alignment with the sequence) and comparing the 
amino acid residues or nucleotides at corresponding positions. The *best alignment is an 
alignment of two sequences which results in the highest percent identity. The percent 
identity is determined by the number of identical amino acid residues or nucleotides in the 
sequences being compared (/. e . , % identity = # of identical positions/total # of positions x 
20 100 ). 

The determination of percent identity between two sequences can be accomplished 
using a mathematical algorithm known to those of skill in the art. An example of a 
mathematical algorithm for comparing two sequences is the algorithm of Karlin and 
Altschul (1990) Proa Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and 
25 Altschul (1993) Proc. Natl. Acad Sci. USA 90:5873-5877. The NBLAST and XBLAST 
programs of Altschul, et al. (1990) J. Mol Biol. 215:403-410 have incorporated such an 
alogrithm. BLAST nucleotide searches can be performed with the NBLAST program, score 
= 100, wordlength = 12 to obtain nucleotide sequences homologous to a nucleic acid 
molecules of the invention. BLAST protein searches can be performed with the XBLAST 
30 program, score ==50, wordlength = 3 to obtain amino acid sequences homologous to a 

protein molecules of the invention. To obtain gapped alignments for comparison purposes, 
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Gapped BLAST can be utilized as described in Altschul et al. (1997) Nucleic Acids Res. 
25:3389-3402. Alternatively, PSI-Blast can be used to perform an iterated search which 
detects distant relationships between molecules (Id.). When utilizing BLAST, Gapped 
BLAST, and PSI-Blast programs, the default parameters of the respective programs (e.g, 
XBLAST and NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. 

Another example of a mathematical algorithm utilized for the comparison of 
sequences is the algorithm ofMyers and Miller, CABIOS (1989). The ALIGN program 
(version 2.0) which is part of the CGC sequence alignment software package has 
incorporated such an alogrithm. Other algorithms for sequence analysis known in the art 
include ADVANCE and ADAM as described in Torellis and Robotti (1994) Comput. Appl. 
Biosci., 10 :3-5; and FASTA described in Pearson and Lipman (1988) Proc. Natl. Acad. Sci. 

55.2444-8. Within FASTA, ktup is a control option that sets the sensitivity and speed of the 
search. 


4 - BRIEF DESCRIPTION OF THE FIOIIRF. 

Figure 1 is an image obtained from 2-dimensional electrophoresis of depleted serum 
representing a combination of normal serum and serum taken from subjects having breast 
cancer, which has been annotated to identify eleven landmark features, designated DS 1 , 
DS2, DS4, DS5, DS6, DS8, DS9, DS10, DS11, DS12, and DS13. 

5 - DETAILED DESCRIPTION OF THF. INVENTION 

The invention described in detail below provides methods and compositions for 
clinical screening, diagnosis and prognosis of breast cancer in a mammalian subject, for 
monitoring the results of breast cancer therapy, identifying patients most likely to respond to 
specific therapeutic treatments, and for drug screening and drug development. The invention 
also encompasses the administration of therapeutic compositions to a mammalian subject to 
treat or prevent breast cancer. The mammalian subject may be a non-human mammal, but is 
preferably human, more preferably a human adult, i.e. a human subject at least 21 (more 
preferably at least 35, at least 50, at least 60, at least 70, or at least 80) years old. For clarity 
of disclosure, and not by way of limitation, the invention will be described with respect to 
the analysis of serum samples. However, as one skilled in the art will appreciate, the assays 



WO 01/13117 


PCT/GB00/03143 


and techniques described below can be applied to other types of samples, including another 
body fluid ( e.g . blood, plasma or saliva), a tissue sample from a subject at risk of having or 
developing breast cancer (e.g. a biopsy such as a breast or lymph node biopsy) or 
homogenate thereof. The methods and compositions of the present invention are useful for 
screening, diagnosis and prognosis of a living subject, but may also be used for postmortem 
diagnosis in a subject, for example, to identify family members of the subject who are at nsk 
of developing the same disease. 

As used herein, serum refers to the supernatant fluid produced by clotting and 
centrifugal sedimentation of a blood sample. 

5.1 Breast Cancer- Associated Features (BFg 

In one aspect of the invention, two-dimensional electrophoresis is used to analyze 
serum from a subject, preferably a living subject, in order to detect or quantify the 
expression of one or more Breast Cancer- Associated Features (BFs) for screening, 
prevention or diagnosis of breast cancer, to determine the prognosis of a subject having 
breast cancer, to monitor progression of breast cancer, to monitor the effectiveness of breast 
cancer therapy, or for drug development. As used herein, "two-dimensional electrophoresis" 
(2D-electrophoresis) means a technique comprising isoelectric focusing, followed by 
denaturing electrophoresis; this generates a two-dimensional gel (2D-gel) containing a 
plurality of separated proteins. Preferably, the step of denaturing electrophoresis uses 
polyacrylamide electrophoresis in the presence of sodium dodecyl sulfate (SDS-PAGE). 
Especially preferred are the highly accurate and automatable methods and apparatus ( 'the 
Preferred Technology") described in International Application No. 97GB3307 (published as 
WO 98/23950) and in U.S. Application No. 08/980,574, both filed December 1 , 1997, each 
of which is incorporated herein by reference in its entirety with particular reference to the 
protocol at pages 23-35. Briefly, the Preferred Technology provides efficient, computer- 
assisted methods and apparatus for identifying, selecting and characterizing biomolecules 
(e.g. proteins, including glycoproteins) in a biological sample. A two-dimensional array is 
generated by separating biomolecules on a two-dimensional gel according to their 
electrophoretic mobility and isoelectric point. A computer-generated digital profile of the 
array is generated, representing the identity, apparent molecular weight, isoelectric point, 
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and relative abundance of a plurality of biomolecules detected in the two-dimensional array, 
thereby permitting computer-mediated comparison of profiles from multiple biological 
samples, as well as computer aided excision of separated proteins of interest. 

A preferred scanner for detecting fluorescently labeled proteins is described in WO 
96/36882 and in the Ph.D. thesis of David A. Basiji, entitled "Development of a High- 
throughput Fluorescence Scanner Employing Internal Reflection Optics and Phase-sensitive 
Detection (Total Internal Reflection, Electrophoresis)", University of Washington (1997), 
Volume 58/12-B of Dissertation Abstracts International, page 6686, the contents of each of 
which are incorporated herein by reference. These documents describe an image scanner 
designed specifically for automated, integrated operation at high speeds. The scanner can 
image gels that have been stained with fluorescent dyes or silver stains, as well as storage 
phosphor screens. The Basiji thesis provides a phase- sensitive detection system for 
discriminating modulated fluorescence from baseline noise due to laser scatter or 
homogeneous fluorescence, but the scanner can also be operated in a non-phase-sensitive 
mode. This phase-sensitive detection capability would increase the sensitivity of the 
instrument by an order of magnitude or more compared to conventional fluorescence 
imaging systems. The increased sensitivity would reduce the sample-preparation load on the 
upstream instruments while the enhanced image quality simplifies image analysis 
downstream in the process. 

A more highly preferred scanner is the Apollo 2 scanner (Oxford Glycosciences, 
Oxford, UK), which is a modified version of the above described scanner. In the Apollo 2 
scanner, the gel is transported through the scanner on a precision lead-screw drive system. 

This is preferable to laying the glass plate on the belt-driven system that is described in the 
Basiji thesis, as it provides a reproducible means of accurately transporting the gel past the 
imaging optics. 

In the Apollo 2 scanner, the gel is secured against three alignment stops that rigidly 
hold the glass plate in a known position. By doing this in conjunction with the above 
precision transport system, the absolute position of the gel can be predicted and recorded. 

This ensures that co-ordinates of each feature on the gel can be determined more accurately 
and communicated, if desired, to a cutting robot for excision of the feature. In the Apollo 2 
scanner, the earner that holds the gel has four integral fluorescent markers for use to correct 
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the image geometry. These markers are a quality control feature that confirms that the 
scanning has been performed correctly. 

In comparison to the scanner described in the Basiji thesis, the optical components of 
the Apollo 2 scanner have been inverted. In the Apollo 2 scanner, the laser, mirror, 

5 waveguide and other optical components are above the glass plate being scanned. The 
scanner described in the Basiji thesis has these components underneath. In the Apollo 2 
scanner, the glass plate is mounted onto the scanner gel side down, so that the optical path 
remains through the glass plate. By doing this, any particles of gel that may break away 
from the glass plate will fall onto the base of the instrument rather than into the optics. This 
1 0 does not affect the functionality of the system, but increases its reliability. 

Still more preferred is the Apollo 3 scanner, in which the signal output is digitized to 
the full 16-bit data without any peak saturation or without square root encoding of the signal. 
A compensation algorithm has also been applied to correct for any variation in detection 
sensitivity along the path of the scanning beam. This variation is due to anomalies in the 
15 optics and differences in collection efficiency across the waveguide. A calibration is 

performed using a perspex plate with an even fluorescence throughout. The data received 
from a scan of this plate are used to determine the multiplication factors needed to increase 
the signal from each pixel level to a target level. These factors are then used in subsequent 
scans of gels to remove any internal optical variations. 

20 As used herein, the term "feature” refers to a spot detected in a 2D gel, and the term 

"Breast Cancer-Associated Feature" (BF) refers to a feature that is differentially present in a 
sample (e.g. a sample of serum) from a subject having breast cancer compared with a sample 
(e.g. a sample of serum) from a subject free from breast cancer. As used herein, a feature (or 
a protein isoform of BPI, as defined infra) is "differentially present" in a first sample with 
25 respect to a second sample when a method for detecting the feature, iso form or BPI (e.g . , 2D 
electrophoresis or an immunoassay) gives a different signal when applied to the first and 
second samples. A feature, isoform or BPI is "increased" in the first sample with respect to 
the second if the method of detection indicates that the feature, isoform or BPI is more 
abundant in the first sample than in the second sample, or if the feature, isoform or BPI is 
30 detectable in the first sample and undetectable in the second sample. Conversely, a feature, 
isoform or BPI is "decreased" in the first sample with respect to the second if the method of 


13 



WO 01/13117 


PCT/GBOO/03143 


detection indicates that the feature, isofom, or BPI is less abundant in the first sample than 

in the second sample or if the feature, isofom, or BPI is undetectable in the first sample and 
detectable in the second sample. 

Preferably, the relative abundance of a feature in two samples is determined in two 
steps. First, the signal obtained upon detecting the feature in a sample is normalized by 
reference to a suitable background parameter, e.g., (a) to the total protein in the sample being 
analyzed (e.g., total protein loaded onto a gel); (b) to an Expression Reference Feature 
(ERF) i.e., a feature whose abundance is invariant, within the limits of variability of the 
Preferred Technology, in the population of subjects being examined, e.g. the ERFs disclosed 
below, or (c) more preferably to the total signal detected from all proteins in the sample. 

Secondly, the normalized signal for the feature in one sample or sample set is 
compared with the normalized signal for the same feature in another sample or sample set in 
order to identify features that are "differentially present" in the first sample (or sample set) 
with respect to the second. 

The BFs disclosed herein have been identified by comparing serum samples from 
subjects having breast cancer against serum samples from subjects free from breast cancer. 
Subjects free from breast cancer include subjects with no known disease or condition 

(normal subjects) and subjects with diseases (including mammary pathologies) other than 
breast cancer. 

Four groups of BFs have been identified through the methods and apparatus of the 
Preferred Technology. The first group consists of BFs that are decreased in the serum of 
subjects having primary breast cancer as compared with the serum of subjects free from 
breast cancer. These BFs can be described by apparent molecular weight (MW) and 
isoelectric point (pi) as provided in Table I. 


TabIeL gFs Decreased In Serum of Subjects H a vine Primarv Rrpac , Cancer 


BF# 

% Feature 
Presence 
(foreground) 

% Feature 

Presence 

(background) 

Fold Change 

Pi 

MW (Da) 

p value 
(Rank-Sum 

test) 

BF-1 

86 

100 

-1.49 

r 7.27 

30450 . 

0.022371 

BF-2 

46 

61 

-1.45 

6.65 

47800 

0.026919 
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BF-3 

100 

100 

-1.44 

7.61 

48250 

0.002279 

BF-4 

80 

100 

-1.41 

5.29 

34070 

0.031476 

BF-5 

100 

100 

-1.33 

4.90 

72090 

0.028553 

BF-7 

93 

100 

-1.32 

4.83 

65170 

0.022503 

BF-8 

100 

100 

-1.31 

5.13 

37100 

0.019952 

BF-9 

100 

100 

-1.26 

5.11 

22910 

0.02864 

BF-10 

100 

100 

-1.25 

4.89 

31960 

0.014466 

BF-12 

100 

100 

-1.24 

4.73 

47250 

0.038099 

BF-13 

100 

100 

-1.23 

5.03 

30780 

0.009829 

BF-14 

100 

100 

-1.22 

6.07 

33400 

0.013643 

BF-42 

94 

93 

-1.36 

4.98 

35440 



The second group consists of BFs that are increased in the serum of subjects having 
primary breast cancer as compared with the serum of subjects free from breast cancer. These 
BFs can be described by MW and pi as follows: 


Table n. BFs Increased in Serum of Subj e cts Having Primary Breast Canc er 


BF# 

% Feature 

Presence 

(foreground) 

% Feature 

Presence 

(background) 

Fold Change 

Pi ’ 

MW (Da) 

p value 
(Rank-Sum 

test) 

BF-15 

66 

46 

2.26 

6.60 

74830 

0.029818 

BF-16 

66 

69 

1.50 

5.74 

35220 

0.027283 

BF-17 

60 

69 

1.25 

6.37 

41260 

0.016833 

BF-18 

* 100 

100 

1.23 

6.20 

67280 

0.022262 

BF-43 

87 

54 

1.23 

6.02 

59410 


BF-44 

67 

85 

1.77 

5.38 

67290 


BF-45 

100 

100 

1.14 

6.15 

191760 




The third group consists of BFs that are decreased in the serum of subjects having metastatic 
10 breast cancer as compared with the serum of subjects free from breast cancer. These BFs can 
be described by MW and pi as follows: 


Table ID. BFs Decreased in Seru m of Subjects Having Metastatic Breast Canc e r 
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BF# 

% Feature 

Presence 

(foreground) 

% Feature 

Presence 

(background] 

Fold Change 

Pi 

I MW (Da) 

p value 
(Rank-Sum 

test) 

BF-19 

44 

84 

-1.91 

5.16 

94860 

0.008122 

BF-20 

83 

76 

-1.83 

5.22 

31160 

0.035008 

BF-22 

72 

92 

-1.79 

6.08 

59520 

0.008979 

BF-23 

100 

92 

-1.66 

7.01 

55950 

0.044225 

BF-26 

72 

84 

-1.51 

5.32 

24490 

0.006342 

BF-27 

100 

100 

-1.45 

5.97 

91410 

0.015438 

BF-28 

100 

100 

-1.35 

5.11 

22910 

0.006103 

BF-29 

r ioo 

100 

r -i.32 

5.26 

20530 

0.047503 

BF-30 

100 

100 

-1.31 

4.79 

47130 

0.029112 

BF-31 

100 

100 

-1.25 

5.15 

73350 

0.032217 

BF-32 

100 

100 

-1.21 

6.51 

51100 

0.010398 

BF-33 

100 

92 

-1.21 

5.35 

81060 

0.034048 

BF-34 

100 — ' 

100 

-1.16 

6.72 

47550 

0.049559 

BF-46 

77 

92 

-1.73 

5.13 

20730 


BF-47 

95 

92 

-1.45 

4.31 

27930 


BF-48 

100 

100 

-1.19 

6.44 

44960 



The fourth group consists of BFs that are increased in the serum of subjects having 
metastatic breast cancer as compared with the serum of subjects free from breast cancer. 
These BFs can be described by MW and pi as follows: 


Table ^ — s ^creased in Serum of Su b jects Having Metastatic Rreact Cancer 


BF# 

% Feature 

Presence 

(foreground) 

% Feature 

Presence 

(background) 

Fold Change 

Pi 

MW (Da) 

p value 
(Rank-Sum 

test) 

BF-35 

44 

53 

1.59 

6.38 

38110 

0.014817 

BF-36 

94 

92 

1.58 

4.51 

51660 

0.025214 

BF-37 

100 

92 

1.54 

4.63 

47200 

0.048935 

BF-38 

66 

92 

1.44 

4.80 

38880 

0.024158 

BF-39 

88 

100 

1.42 

6.20 

67280 

0.029489 
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BF-40 

100 

100 

1.37 

5.34 

16620 

0.007728 

BF-41 

88 

76 

1.02 

5.62 

40830 

0.034673 


For any given BF, the signal obtained upon analyzing serum from subjects having breast 
cancer relative to the signal obtained upon analyzing serum from subjects free from breast 
cancer will depend upon the particular analytical protocol and detection technique that is 
used. Accordingly, the present invention contemplates that each laboratory will, based on 
the present description, establish a reference range for each BF in subjects free from breast 
cancer according to the analytical protocol and detection technique in use, as is conventional 
in the diagnostic art. Preferably, at least one control positive serum sample from a subject 
known to have breast cancer or at least one control negative serum sample from a subject 
known to be free from breast cancer (and more preferably both positive and negative control 
samples) are included in each batch of test samples analyzed. In one embodiment, the level 
of expression of a feature is determined relative to a background value, which is defined as 
the level of signal obtained from a proximal region of the image that (a) is equivalent in area 
to the particular feature in question; and (b) contains no discemable protein feature. 

In a preferred embodiment, the signal associated with an BF in the serum of a subject 
(e.g., a subject suspected of having or known to have breast cancer) is normalized with 
reference to one or more ERFs detected in the same 2D gel. As will be apparent to one of 
ordinary skill in the art, such ERFs may readily be determined by comparing different 
samples using the Preferred Technology. Suitable ERFs include (but are not limited to) that 
described in the following table. 


Table V. Expression Reference Features 


ERF-# 

MW (Dal 

PI 

ERF-1 

53370 

6.17 

| ERF-2 

30780 

5.03 


As those of skill in the art will readily appreciate, the measured MW and pi of a 
given feature or protein isoform will vary to some extent depending on the precise protocol 
used for each step of the 2D electrophoresis and for landmark matching. As used herein, the 
terms "MW" and "pi" are defined, respectively, to mean the apparent molecular weight (in 
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Daltons) and the apparent isoelectric point of a feature or protein isoform as measured in 
exact accordance with the Reference Protocol identified in Section 5 below. When the 
Reference Protocol is followed and when samples are run in duplicate or a higher number of 
replicates, variation in the measured mean pi of an BF or BPI is typically less than 3% and 
5 variation in the measured mean MW of an BF or BPI is typically less than 5%. Where the 
skilled artisan wishes to deviate from the Reference Protocol, calibration experiments should 
be performed to compare the MW and pi for each BF or protein isofoim as detected (a) by 
the Reference Protocol and (b) by the deviant protocol. 

BFs can be used for detection, prognosis, diagnosis, monitoring of breast cancer or 
10 for drug development, or identifying patients most likely to respond to specific therapeutic 
treatments. In one embodiment of the invention, serum from a subject (e.g., a subject 
suspected of having breast cancer) is analyzed by 2D electrophoresis for quantitative 
detection of one or more of the follow ing BFs: BF-1, BF-2, BF-3, BF-4, BF-5, BF-7, BF-8 
BF-9, BF-10, BF-12, BF-1 3, BF-14, BF-42. A decreased abundance ofsaid ole or more ’ 

1 5 BFs in the serum from the subject relative to serum from a subject or subjects free from 

breast cancer (e.g., a control sample or a previously deteimined reference range) indicates 
the presence of primary breast cancer. 

In another embodiment of the invention, serum from a subject is analyzed by 2D 
electrophoresis for quantitative detection of one or more of the following BFs: BF-1 5, BF- 
20 16, BF-1 7, BF-1 8, BF-43, BF-44, BF-45. An increased abundance ofsaid one or more BFs 

in the serum from the subject relative to serum from a subject or subjects free from breast 
cancer (e.g:, a control sample or a previously determined reference range) indicates the 
presence of primary breast cancer. 

In another embodiment, serum from a subject is analyzed for quantitative detection of 

° (3) ° ne ° r m0re BFs ’ or ^ combination of them, whose decreased abundance indicates the 

presence of primary breast cancer, i.e., BF-1, BF-2, BF-3, BF-4, BF-5, BF-7, BF-8, BF-9, 
BF-10, BF-12, BF-13, BF-14, BF-42; and (b) one or more BFs, or any combination of them, 
whose increased abundance indicates the presence of primary breast cancer, i.e.. BF-15, BF-' 

16, BF-17, BF-18, BF-43, BF-44, BF-45. 

30 1,1 another embodiment of the invention, serum from a subject is analyzed by 2D 

electrophoresis for quantitative detection of one or more of the following BFs: BF- 1 9, BF- 
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20, BF-22, BF-23, BF-26, BF-27, BF-28, BF-29, BF-30, BF-31, BF-32, BF-33, BF-34, BF- 
46, BF-47, BF-48. A decreased abundance of said one or more BFs in the serum from the 
subject relative to serum from a subject or subjects free from breast cancer (e.g., a control 
sample or a previously determined reference range) indicates the presence of metastatic 
5 breast cancer. 

In another embodiment of the invention, serum from a subject is analyzed by 2D 
electrophoresis for quantitative detection of one or more of the following BFs: BF-35, BF- 
36, BF-37, BF-38, BF-39, BF-40, BF-41. An increased abundance of said one or more BFs 
in the serum from the subject relative to serum from a subject or subjects free from breast 
10 cancer (e.g., a control sample or a previously determined reference range) indicates the 
presence of metastatic breast cancer. 

In another embodiment, serum from a subject is analyzed for quantitative detection 
of (a) one or more BFs, or any combination of them, whose decreased abundance indicates 
the presence of metastatic breast cancer, i.e., BF-19, BF-20, BF-22, BF-23, BF-26, BF-27, 

15 BF-28, BF-29, BF-30, BF-31, BF-32, BF-33, BF-34, BF-46, BF-47, BF-48; and (b) one or 

more BFs, or any combination of them, whose increased abundance indicates the presence of 
metastatic breast cancer, i.e., BF-35, BF-36, BF-37, BF-38, BF-39, BF-40, BF-41. 

One skilled in the art can readily see that by comparing suitable combinations of 
BFs, it will be possible to differentially diagnose primary versus metastatic breast cancer. 

20 In a further embodiment, serum from a subject is analyzed for quantitative detection 

of (a) one or more BFs, or any combination of them, whose decreased abundance indicates 
the presence of breast cancer, i.e., BF-1, BF-2, BF-3, BF-4, BF-5, BF-7, BF-8, BF-9, BF-10, 
BF-12, BF-13, BF-14, BF-19, BF-20, BF-22, BF-23, BF-26, BF-27, BF-28, BF-29, BF-30, 
BF-31, BF-32, BF-33, BF-34, BF-42, BF-46, BF-47, BF-48; and (b) one or more BFs, or 
25 any combination of them, whose increased abundance indicates the presence of breast 

cancer, i.e. BF-15, BF-16, BF-17, BF-18, BF-35, BF-36, BF-37, BF-38, BF-39, BF-40, BF- 
41, BF-43, BF-44, BF-45. 

In yet another embodiment of the invention, serum from a subject is analyzed by 2D 
electrophoresis for quantitative detection of one or more of the following BFs: BF-1, BF-2, 
30 BF-3, BF-4, BF-5, BF-7, BF-8, BF-9, BF-10, BF-12, BF-13, BF-14, BF-15, BF-16, BF-17, 

BF-18, BF-19, BF-20, BF-22, BF-23, BF-26, BF-27, BF-28, BF-29, BF-30, BF-31, BF-32, 
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BF-33, BF-34, BF-35, BF-36, BF-37, BF-38, BF-39, BF-40, BF-41, BF-42, BF-43, BF-44. 
BF-45, BF-46, BF-47, BF-48 wherein the ratio of the one or more BFs relative to an 
Expression Reference Feature (ERF) indicates whether breast cancer is present. 

In a specific embodiment, a decrease in one or more BF/ERF ratios in a test sample 
relative to the BF/ERF ratios in a control sample or a reference range indicates the presence 
of primary breast cancer; BF-1, BF-2, BF-3, BF-4, BF-5, BF-7, BF-8, BF-9, BF-10, BF-12, 
BF-13, BF-14, BF-42 are suitable BFs for this purpose. In another specific embodiment, an 
increase in one or more BF/ERF ratios in a test sample relative to the BF/ERF ratios in a 
control sample or a reference range indicates the presence of primary breast cancer; BF-1 5, 
BF-1 6, BF-17, BF-1 8, BF-43, BF-44, BF-45 are suitable BFs for this puipose. 

In a further specific embodiment, serum from a subject is analyzed by 2D 
electrophoresis for quantitative detection of (a) one or more BFs, or any combination of 
them, whose decreased BF/ERF ratio(/s) in a test sample relative to the BF/ERF rations) in a 
control sample indicates the presence of primary breast cancer, i.e., BF-1, BF-2, BF-3, BF-4, 
BF-5, BF-7, BF-8, BF-9, BF-10, BF-12, BF-13, BF-14, BF-42; and (b) one or more BFs, or 
any combination of them, whose increased BF/ERF rations) in a test sample relative to the 
BF/ERF ratio(/s) in a control sample indicates the presence of primary breast cancer, i.e., 
BF-1 5, BF-1 6, BF-17, BF-18, BF-43, BF-44, BF-45. 

In a specific embodiment, a decrease in one or more BF/ERF ratios in a test sample 
relative to the BF/ERF ratios in a control sample or a reference range indicates the presence 
of metastatic breast cancer; BF-1 9, BF-20, BF-22, BF-23, BF-26, BF-27, BF-28, BF-29, BF- 
30, BF-31, BF-32, BF-33, BF-34, BF-46, BF-47, BF-48 are suitable BFs for this puipose. In 
another specific embodiment, an increase in one or more BF/ERF ratios in a test sample 
relative to the BF/ERF ratios in a control sample or a reference range indicates the presence 
of metastatic breast cancer; BF-35, BF-36, BF-37, BF-38, BF-39, BF-40, BF-41 are suitable 
BFs for this purpose. 

In a further specific embodiment, serum from a subject is analyzed by 2D 
electrophoresis for quantitative detection of (a) one or more BFs, or any combination of. 
them, whose decreased BF/ERF rations) in a test sample relative to the BF/ERF rations) in a 
control sample indicates the presence of metastatic breast cancer, i.e., BF-19, BF-20, BF-22, 
BF-23, BF-26, BF-27, BF-28, BF-29, BF-30, BF-31, BF-32, BF-33, BF-34, BF-46, BF-47, 
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BF-48; and (b) one or more BFs, or any combination of them, whose increased BF/ERF 
rations) in a test sample relative to the BF/ERF rations) in a control sample indicates the 
presence of metastatic breast cancer, i.e, BF-35, BF-36, BF-37, BF-38, BF-39, BF-40, BF- 
41. 

In a further specific embodiment, serum from a subject is analyzed by 2D 
electrophoresis for quantitative detection of (a) one or more BFs, or any combination of 
them, whose decreased BF/ERF rations) in a test sample relative to the BF/ERF rations) in a 
control sample indicates the presence of breast cancer, i.e., BF-1, BF-2, BF-3, BF-4, BF-5, 
BF-7, BF-8, BF-9, BF-10, BF-12, BF-13, BF-14, BF-19, BF-20, BF-22, BF-23, BF-26, BF- 
27, BF-28, BF-29, BF-30, BF-31, BF-32, BF-33, BF-34, BF-42, BF-46, BF-47, BF-48; and 
(b) one or more BFs, or any combination of them, whose increased BF/ERF ratio(/s) in a test 
sample relative to the BF/ERF rations) in a control sample indicates the presence of breast 
cancer, Le., BF-15, BF-16, BF-17, BF-18, BF-35, BF-36, BF-37, BF-38, BF-39, BF-40, BF- 
41, BF-43, BF-44, BF-45. 

In a preferred embodiment, serum from a subject is analyzed for quantitative 
detection of a plurality of BFs. 

5.2 Breast Cancer- Associated Protein Isoforms (BPIs) 

In another aspect of the invention, serum from a subject, preferably a living subject, 
is analyzed for quantitative detection of one or more Breast Cancer- Associated Protein 
Isoforms (BPIs) for screening or diagnosis of breast cancer, to determine the prognosis of a 
subject having breast cancer, to monitor the effectiveness of breast cancer therapy, or for 
drug development, or for identifying patients most likely to respond to a particular 
therapeutic treatment. As is well known in the art, a given protein may be expressed as 
variants (isoforms) that differ in their amino acid composition (e.g., as a result of alternative 
splicing or limited proteolysis) or as a result of differential post-translational modification 
(e.g., glycosylation, phosphorylation, acylation), or both, so that proteins of identical amino 
acid sequence can differ in their pi, MW, or both. It follows that differential presence of a 
protein isofonn does not require differential expression of the gene encoding the protein in 
question. As used herein, the term "Breast Cancer- Associated Protein Isoforni" refers to a 
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protein isoform that is differentially present in serum from a subject having breast cancer 
compared with serum from a subject free from breast cancer. 

Four groups of BPIs have been identified by partial amino acid sequencing of BFs, 
using the methods and apparatus of the Preferred Technology. The first group consists of 
BPIs that are decreased in the serum of subjects having primary breast cancer as compared 
with the serum of subjects free from breast cancer, where the differential presence is 
significant. The partial amino acid sequences identified by tandem mass spectrometry for 
these BPIs are listed in Table VI. For each BPI, a list of accession numbers of protein 
sequences is given, each of which incoiporates all partial amino acid sequences identified for 
the BPI. For some BPIs, the partial sequence information derived from tandem mass 
spectrometry was not found to be described in any known public database. These are listed 
as 'NOVEL' in Table VI, and the partial amino acid sequence information for these BPIs is 
given in in Table XII. 


Jable VI. BPIs Decreased in Serum of Subjects having Primary Breast Cancer 


BF# 

1 BPI# 

Amino Acid Sequences from 
Tandem Mass Spectrometry 

Accession Numbers of Identified 
Sequences* 

BF-1 

BPM 

CSVFYGAPSK 

VEYGFQVK 

FACYYPR 

1 16602 (gb) P01028 (SWISS-PROT) 
179674 (gb) 

2347136 (gb) 

443671 (gb) 

BF-1 

BPI-50 

See Table XII 

NOVEL “j 

BF-5 

BPI-5 

QEDDLANINQWVK 

LCQDLGPGAFR 

1 12907 (gb) P08697 (SWISS-PROT) 
178751 (gb) 

219410 (gb) 

BF-5 

BPI-6 

WLQGSQELPR 

223099 (gb) 

229585 (gb) 

223069 (gb) 

229537 (gb) 

113585 (gb) P01877 (SWISS-PROT) 
2135473 (gb) 

87783 (gb) 

70058 (gb) 

2190501 (gb) 

2190363 (gb) 

86666 (gb) 

113583 (gb) P20758 (SWISS-PROT) 
184749 (gb) 
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113584 (gb) P01876 (SWISS-PROT) 
3201900 (gb) 

2160055 (gb) 

2160054 (gb) 

BF-5 

BP 1-40 

QSLEASLAETEGR 

623409 (gb) 

88042 (gb) 

307086 (gb) 

547749 (gb) PI 3645 (SWISS-PROT) 
71528 (gb) 

186629 (gb) 

BF-9 

BPI-9 

AKPALEDLR 

ATEHLSTLSEK 

THLAPYSDELR 

VSFLSALEEYTK 

VQPYLDDFQK 

178775 (gb) 

113992 (gb) P02647 (SWISS-PROT) 
178777 (gb) 

229479 (gb) 

BF-10 

BPI-11 

SEIDLFNIR 

GLGTDEESILTLLTSR 

GAGTDDHTLIR 

113960 (gb) P08758 (SWISS-PROT) 
809185 (gb) 

BF-10 

BPI-10 

ETLLQDFR 

122801 (gb) P02760 (SWISS-PROT) 
223373 (gb) 

BF-12 

BPI-12 

TEQWSTLPPETK 

VLSLAQEQVGGSPEK 

QGSFQGGFR 

ADGSYAAWLSR 

AEMADQAAAWLTR 

179674 (gb)2347136 (gb) 

187771 (gb) - 
223961 (gb) 

223962 (gb) 

BF-13 

BPI-13 

ETLLQDFR 

122801 (gb) P02760 (SWISS-PROT) 
223373 (gb) 

BF-14 

BPI-14 

YGIDWASGR 

TFAHYATFR 

LLGEVDHYQLALGK 

GEPGDPVNLLR 

QDGSVDFFR 

3413516 (gb) 

BF-14 

BPI-53 

See Table XII 

NOVEL 


BF-42 BPM1 See Table XII NOVEL 

> Accession riumbere of sequences identified from. the GenBank database (described in Buries, et aT GenBank: Current 
Status and Future Directions, Methods in Enzymology 1 83:3 (1990)) are indicated with (gb) or (GBI) following the 
accession number . Where a corresponding sequence entry has been identified in the SWISS-PROT database ^ dc ^ lb ^ !" 
Bairoch et al. The SWISS-PROT protein sequence data bank, recent developments , Nucleic Acids Research, 2 1 : 3093-3096 
5 ( 1 993)) the accession number of the SWISS-PROT entry is also given alongside the accession number for the 

corresponding GenBank entry. 


The second group comprises BPIs that are increased in the serum of subjects having primary 
breast cancer as compared with the serum of subjects free from breast cancer, where the 
10 differential presence is significant. The partial amino acid sequences identified by tandem 
mass spectrometry for these BPIs are listed in Table VIL For each BPI, a list of accession 
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numbers of protein sequences is given, each of which incoiporates all partial amino acid 
sequences identified for the BPI. For some BPIs, the partial sequence information derived 
from tandem mass spectrometry was not found to be described in any known public 
database. These are listed as "NOVEL' in Table VII, and the partial amino acid sequence 
information for these BPIs is given in in Table XII. 


BF# 

J BPI# 

— — V.VUUVU Hi LJV/1U1I1 Ui J 

Amino Acid Sequences from 
Tandem Mass Spectrometry 

uoiecis naving rnmarv Breast Cancer 

Accession Numbers of Identified 
Sequences* 

BF-17 

BPI-54 

See Table XII 

NOVEL ~ 

BF-18 

BPI-55 

iSee Table XII 

NOVEL 

BF-43 

BPI-42 

See Table XII 

NOVEL " 

BF-44 

DC A C 

BPM3 

See Table XII 

NOVEL 

br-4o 

erw | see Table XII 

NOVEL ~ 


The third group comprises BPIs that are decreased in the serum of subjects having metastatic 
10 breast cancer as compared with the serum of subjects free from breast cancer, where the 

differential presence is significant. The partial amino acid sequences identified by tandem 
mass spectromehy for these BPIs are listed in Table VIII. For each BPI, a list of accession 
numbers of protein sequences is given, each of which incoiporates all partial amino acid 
sequences identified for the BPI. For some BPIs, the partial sequence inhumation derived 
1 5 from tandem mass spectromehy was not found to be described in any known public 

database. These are listed as NOVEL' in Table VIII, and the partial amino acid sequence 
information for these BPIs is given in in Table XII. 


Table VM. B PIs Decreased In Serum of Subjec ts Having Metastatic Breast Cancer 


BF# 

BPI# 

Amino Acid Sequences from 
Tandem Mass Spectrometry 

xrrtviim meiasianc tfreast Lancer 

Accession Numbers of Identified 
Sequences* 

BF-19 

BP 1-19 

NGVAQEPVHLDSPAIK 

ATWSGAVLAGR 

CEGPIPDVTFELLR 

CLAPLEGAR 

HQFLLTGDTQGR 

LELHVDGPPPRPQLR 

112892 (gb) P04217 (SWISS-PROT) 




339685 (gb) 
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BF-20 

BPI-21 

GSPAINVAVHVFR 

443295 (gb) 

1181952 (gb) 

136464 (gb) P02766 (SWISS-PROT) 
443297 (gb) 

1336728 (gb) 

4261798 (gb) 

BF-20 

BPI-20 

AKPALEDLR 

DEPPQSPWDR ATEHLSTLSEK 
THLAPYSDELR 

VQPYLDDFQK 

229479 (gb) 

1 78775 (gb) 

113992 (gb) P02647 (SWISS-PROT) 
178777 (gb) 

BF-22 

BPI-49 

See Table XII 

NOVEL 

BF-23 

BP 1-24 

ATWYQGER 

543826 (gb) P02749 (SWISS- 
PROT)31 9918 (gb) 

BF-23 

BP 1-23 

LEQEIATYR 

547750 (gb) P35900 (SWISS-PROT) 
542923 (gb) 

2119209 (gb) 

386803 (gb) 

417200 (gb) P08727 (SWISS-PROT) 
125081 (gb) P19012 (SWISS-PROT) 
125077 (gb) PI 3646 (SWISS-PROT) 
3603253 (gb) 

1708589 (gb) P30654 (SWISS-PROT) 
88057 (gb) 

632732 (gb) 

4321795 (gb) 

1363944 (gb) 

1346342 (gb) P08779 (SWISS-PROT) 
88047 (gb) . 

547751 (gb) Q04695 (SWISS-PROT) 
87774 (gb) 

125080 (gb) P02533 (SWISS-PROT) 
177139 (gb) 

BF-23 

BPI-25 

QDGSVDFGR 

IRPFFPQQ 

LESDVSAQMEYCR 

EDGGGWWYNR 

DNDGWLTSDPR 

182430 (gb) 

399492 (gb) P02675 (SWISS-PROT) 
484509 (gb) 

223002 (gb) 

BF-27 

BPI-27 

EPGLQIWR 

HWPNEWVQR 

121 1 16 (gb) P06396 (SWISS-PROT)' 

BF-27 

BPI-51 

See Table XII 

NOVEL 

BF-28 

BPI-28 

AKPALEDLR 

ATEHLSTLSEK 

THLAPYSDELR 

VSFLSALEEYTK VQPYLDDFQK 

1 78775 (gb) 

113992 (gb) P02647 (SWISS-PROT) 
178777 (gb) 

229479 (gb) 

BF-29 

BPI-29 

LIVHNGYCDGR 

QEELCLAR 

132404 (gb) P02753 (SWISS-PROT) 
88364 (gb) 
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FSGTWYAMAK 

YWGVASFLQK 


BF-30 

BP 1-52 

See Table XII 

NOVEL 

BF-31 

BP 1-31 

NGVAQEPVHLDSPAIK 

SGLSTGWTQLSK 

ATWSGAVLAGR 

CLAPLEGAR 

HQFLLTGDTQGR 

LETPDFQLFK 

1 12892 (gb) P04217 (SWISS-PROT) 

BF-32 

BPI-32 

GECQAEGVLFFQGDR 

VWVYPPEK 

DYFMPCPGR 

YYCFQGNQFLR 

386789 (gb) 

1335098 (gb) 

1708182 (gb) P02790 (SWISS-PROT) 

BF-33 

BP 1-33 

ANVFVQLPR 

TFTPQPPGLER 

LEALPNSLLAPLGR 

LAELPADALGPLQR 

NLPEQVFR 

[543800 (gb) P35858 (SWISS-PROT) 

BF-34 

BPI-34 

DYFMPCPGR 

386789 (gb) 

1335098 (gb) 

1708182 (gb) P02790 (SWISS-PROT) 

BF-34 

BPI-56 

See Table XII 

NOVEL 

BF-46 

BPM5 

See Table XII 

NOVEL 

BF-47 

BPI-46 

See Table XII 

NOVEL 

BF-48 

* A ^('ACfinn ni 

BPI-47 

See Table XII 

NOVEL 


1 — — * ,w “' M,v '-vi.ucuijv uowuosc DiuKs, ci ai. t jentsanK : Current 

Status and Future Directions. Methods in Enzymology 183:3 (1990)) are indicated with (gb) or (GBI) following the 
accession number Where a corresponding sequence entry has been identified in the SWISS-PROT database (described in 
Bairoch et al. The SWISS-PROT protein sequence data bank, recent developments > Nucleic Acids Research 21-3093-3096 
(1993)) the accession number of the SWISS-PROT entry is also given alongside the accession number for the 
corresponding GenBank entry. 


The fourth group comprises BPIs that are increased in the serum of subjects having 
metastatic breast cancer as compared with the serum of subjects free from breast cancer, 
where the differential presence is significant. The partial amino acid sequences identified by 
tandem mass spectrometry for these BPIs are listed in Table IX. For each BPI, a list of 
accession numbers of protein sequences is given, each of which incorporates all partial 
amino acid sequences identified for the BPI. For some BPIs, the partial sequence 
information derived from tandem mass spectrometry was not found to be described in any 
known public database. These are listed as hJOVEL 1 in Table EX, and the partial amino acid 
sequence information for these BPIs is given in in Table XII. 
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Table IX. BPIs Increased In Serum of Subjects Having Metastatic Breast Cancer 


5 


10 


15 


20 


25 


BF# 

BPI# 

Amino Acid Sequences from 
Tandem Mass Spectrometry 

Accession Numbers of Identified 
Sequences* 

BF-37 

BPI-37 

ALGHLDLSGNR 

VAAGAFQGLR 

YLFLNGNK 

ENQLEVLEVSWLHGLK 

112908 (gb) P02750 (SWISS-PROT) 

BF-40 

BPI-48 

See Table XII 

NOVEL 


♦Accession numbers of sequences identified from the GenBank database (described in Burks, et al. GenBank: Current 
Status and Future Directions, Methods in Enzymology 183:3(1 990)) are indicated with (gb) or (GBI) following the 
accession number . Where a corresponding sequence entry has been identified in the SWISS-PROT database (described in 
Bairoch et al. The SWISS-PROT protein sequence data bank, recent developments , Nucleic Acids Research, 21 :3093-3096 
(1993)) the accession number of the SWISS-PROT entry is also given alongside the accession number for the 
corresponding GenBank entry. 


As will be evident to one of skill in the art, based upon the present description, a 
given BPI can be described according to the data provided for that BPI in Table VI, VH, 

Vm or DC. The BPI is a protein comprising a peptide sequence described for that BPI 
(preferably comprising a plurality of, more preferably all of, the peptide sequences described 
for that BPI) and has a pi of about the value stated for that BPI (preferably within 10%, more 
preferably within 5% still more preferably within 1% of the stated value) and has a MW of 
about the value stated for that BPI (preferably within 10%, more preferably within 5%, still 
more preferably within 1% of the stated value). 

In one embodiment, serum from a subject is analyzed for quantitative detection of 
one or more of the following BPIs: BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, BPI-11, BPI-12, 
BPI- 13, BPI- 14, BPI-40, BPI-41, BPI-50, BPI-53 or any combination of them, wherein a 
decreased abundance of the BPI or BPIs (or any combination of them) in the serum from the 
subject relative to serum from a subject or subjects free from breast cancer (e.g., a control 
sample or a previously determined reference range) indicates the presence of primary breast 
cancer. 

s 

In another embodiment of the invention, serum from a subject is analyzed for 
quantitative detection of one or more of the following BPIs: BPI-42, BPI-43, BPI-44, BPI- 
54, BPI-55 or any combination of them, wherein an increased abundance of the BPI or BPIs 
(or any combination of them) in serum from the subject relative to serum from a subject or 
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subjects free from breast cancer ( e.g a control sample or a previously determined reference 
range) indicates the presence of primary breast cancer. 

In another embodiment, serum from a subject is analyzed for quantitative detection of 
(a) one or more BPIs, or any combination of them, whose decreased abundance indicates the 
presence of primary breast cancer, i.e., BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, BPI-1 1, BPI-12, 
BPI-13, BPI-1 4, BPI-40, BPI-41, BPI-50, BPI-53; and (b) one or more BPIs, or any 
combination of them, whose increased abundance indicates the presence of primary breast 
cancer, i.e, BPI-42, BPI-43, BPI-44, BPI-54, BPI-55. 

In another embodiment of the invention, serum from a subject is analyzed for 
quantitative detection of one or more of the following BPIs: BPI-1 9, BPI-20, BPI-21, BPI- 
23, BPI-24, BPI-25, BPI-27, BPI-28, BPI-29, BPI-31, BPI-32, BPI-33, BPI-34, BPI-45, 
BPI-46, BPI-47, BPI-49, BPI-51, BPI-52, BPI-56 or any combination of them, wherein an 
decreased abundance of the BPI or BPIs (or any combination of them) in serum from the 
subject relative to serum from a subject or subjects free from breast cancer (e.g., a control 

sample or a previously determined reference range) indicates the presence of metastatic 
breast cancer. 

In another embodiment of the invention, serum from a subject is analyzed for 
quantitative detection of one or more of the following BPI : BPI-37, BPI-48 wherein an 
increased abundance of the BPI in serum from the subject relative to serum from a subject or 
subjects free from breast cancer (e.g., a control sample or a previously determined reference 
range) indicates the presence of metastatic breast cancer. 

In another embodiment, serum from a subject is analyzed for quantitative detection of 
(a) one or more BPIs, or any combination of them, whose decreased abundance indicates the 
presence of metastatic breast cancer, i.e., BPI-19, BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, 
BPI-27, BPI-28, BPI-29, BPI-31, BPI-32, BPI-33, BPI-34, BPI-45, BPI-46, BPI-47, BPI-49, 
BPI-51, BPI-52, BPI-56; and (b) one or more BPIs, or any combination of them, whose 
increased abundance indicates the presence of metastatic breast cancer, i.e., BPI-37, BPI-48. 

One skilled in the art can readily see that, by comparing a suitable combination of 
BPIs, it is possible to differentially diagnose primary versus metastatic breast cancer. 

In a further embodiment, serum from a subject is analyzed for quantitative detection 
of (a) one or more BPIs, or any combination of them, whose decreased abundance indicates 
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the presence of breast cancer, i.e., BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, BPI-1 1, BPI-12, 
BPI-13, BPI-1 4, BPI-19, BPI-20, BPI-21, BPI-23, BP1-24, BPI-25, BPI-27, BPI-28, BPI-29, 
BPI-31, BPI-32, BPI-33, BPI-34, BPI-40, BPI-41, BPI-45, BPI-46, BPI-47, BPM9, BPI-50, 
BPI-5 1, BPI-52, BPI-53, BPI-56; and (b) one or more BPIs, or any combination of them, 

5 whose increased abundance indicates the presence of breast cancer, z.e.,BPI-37, BPI-42, 
BPI-43, BPI-44, BPI-48, BPI-54, BPI-55. 

In yet a further embodiment, serum from a subject is analyzed for quantitative 
detection of one or more BPIs and one or more previously known biomarkers of breast 
cancer (e.g., shed c-erb-B2 fragment Payne et al. 2000, Clin. Chem. 46:175-182). In 
10 accordance with this embodiment, the abundance of each BPI and known biomarker relative 
to a control or reference range indicates whether a subject has breast cancer. 

Preferably, the abundance of a BPI is normalized to an Expression Reference Protein 
Isoform (ERPI). ERPIs can be identified by partial amino acid sequencing of ERFs, which 
are described above, using the methods and apparatus of the Preferred Technology. The 
15 partial amino acid sequences of an ERPI, and the known proteins to which it is homologous 
is presented in Table X. 


20 


25 


Table X 


ERPI-# 

ERF-# 

Accession Numbers of Identified 
Sequences* 

Amino Acid Sequences from 
Tandem Mass Spectrometry 

ERPI-1 

ERF-2 

122801 (gb) P02760 (SwissProt) 

223373 (gb) 

frnm ihp npnRank database fdes 

ETLLQDFR 

cribed in Burks, et al. GenBank: Current 


Status and Future Directions. Methods in Enzymology 1 83:3 (1990)) are indicated wtth (gb) or (GBI) following the 
accession number . Where a corresponding sequence entry has been identified in the SWISS-PROT database (descnbedm 
Bairoch et al. The SWISS-PROT protein sequence data bank, recent developments , Nucleic Acids Research, 21 :3093-3096 
(1993)) the accession number of the SWISS-PROT entry is also given alongside the accession number for the 
corresponding GenBank entry. 


As shown above, the BPIs described herein include previously unknown proteins, as well as 
isoforms of known proteins where the isoforms were not previously known to be associated 
with breast cancer. For each BPI, the present invention additionally provides: (a) a 
preparation comprising the isolated BPI; (b) a preparation comprising one or more fragments 
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of the BPI; and (c) antibodies that bind to said BPI, to said fragments, or both to said BPI 
and to said fragments. As used herein, a BPI is "isolated" when it is present in a preparation 
that is substantially free of contaminating proteins, i.e., a preparation in which less than 10% 
(preferably less than 5%, more preferably less than 1%) of the total protein present is 
contaminating protein(s). A contaminating protein is a protein or protein isoform having a 
significantly different pi or MW from those of the isolated BPI, as determined by 2D 
electrophoresis. As used herein, a "significantly different" pi or MW is one that permits the 
contaminating protein to be resolved from the BPI on 2D electrophoresis, performed 
according to the Reference Protocol. 

In one embodiment, an isolated protein is provided, said protein comprising a peptide 
with the amino acid sequence identified in Table VI, VII, Vm or IX for a BPI, said protein 
having a pi and MW within 10% (preferably within 5%, more preferably within 1%) of the 
values identified in Table I, II, III or IV for that BPI. 

The BPIs of the invention can be qualitatively or quantitatively detected by any 
method known to those skilled in the art, including but not limited to the Preferred 
Technology described herein, kinase assays, immunoassays, and western blotting. In one 
embodiment, the BPIs are separated on a 2-D gel by virtue of their MWs and pis and 
visualized by staining the gel. In one embodiment, the BPIs are stained with a fluorescent 
dye and imaged with a fluorescence scanner. Sypro Red (Molecular Probes, Inc., Eugene, 
Oregon) is a suitable dye for this puipose. Alternative dyes are described in USSN 
09/412,168, filed October 5 1999, and incorporated herein by reference in its entirety. 

Alternatively, BPIs can be detected in an immunoassay. In one embodiment, an 
immunoassay is performed by contacting a sample from a subject to be tested with an anti- 
BPI antibody under conditions such that immunospecific binding can occur if the BPI is 
present, and detecting or measuring the amount of any immunospecific binding by the 
antibody. Anti-BPI antibodies can be produced by the methods and techniques taught 
herein; examples of such antibodies known in the art are set forth in Table XI. These 
antibodies shown in Table XI are already known to bind to the protein of which the BPI is 
itself a family member. Preferably, the anti-BPI antibody preferentially binds to the BPI 
rather than to other isoforms of the same protein. In a preferred embodiment, the anti-BPI 
antibody binds to the BPI with at least 2-fold greater affinity, more preferably at least 5-fold 



WO 01/13117 


PCT/GB00/03143 


greater affinity, still more preferably at least 10-fold greater affinity, than to said other 
isoforms of the same protein. When the antibodies shown in Table XI do not display the 
required preferential selectivity for the target BPI, one skilled in the art can generate 
additional antibodies by using the BPI itself for the generation of such antibodies. 

5 BPIs can be transferred from the gel to a suitable membrane (e.g. a PVDF 

membrane) and subsequently probed in suitable assays that include, without limitation, 
competitive and non-competitive assay systems using techniques such as western blots and 
"sandwich" immunoassays using anti-BPI antibodies as described herein, e.g., the antibodies 
identified in Table XI, or others raised against the BPIs of interest. The immunoblots can be 
1 0 used to identify those anti-BPI antibodies displaying the selectivity required to immuno- 
specifically differentiate a BPI from other isoforms encoded by the same gene. 


Table XI. Known Antibodies That Recognize BPIs or BPI-Related Polypeptides 


Protein family 
of which BPi 
is a member 

Antibody 

Manufacturer 

Cat. No. 

BPI-5 

alpha-2-antipiasmin 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

YN-RHAPL 

BPI-10 

annexin v (lipcortin v) 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

YM-9020 

BPI-11 

alpha-1 -microglobulin 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

UCB- 

A750/R1H/1 

BPI-13 

alpha-1-microglobuiin 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

UCB- 

A750/R1H/1 

BPI-21 

transthyretin 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

AXL- 125/2 

BPI-23 

keratin 16 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

MED-CLA 

194 

BPI-24 

beta-2-glycoprotein 1 precursor 
(apolipoprotein H) 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

ACL-20020A 

BPI-25 

human fibrinogen beta-chain 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

M22090M 

BPI-27 

gelsolin precursor, plasma (actin- 
depolymerizing factor) 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

RDI- 

IGFBP2abr 

BPI-29 

plasma retinoi-binding protein 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

RDI- 

CLUSTRCab 

G 

BPI-32 

hemopexin precursor 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

BY A- 6019-1 
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BPI-33 

insulin-like growth factor binding 
protein complex acid labile chain 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

BMD- D22 

BPI-34 

hemopexin precursor 

ACCURATE CHEMICAL & 
SCIENTIFIC CORPORATION 

AXL- 574 


In one embodiment, binding of antibody in tissue sections can be used to detect 
aberrant BPI localization or an aberrant level of one or more BPIs. In a specific 
embodiment, antibody to a BPI can.be used to assay a tissue sample (e.g., a breast biopsy) 
from a subject for the level of the BPI where an aberrant level of BPI is indicative of breast 
cancer. As used herein, an "aberrant level" means a level that is increased or decreased 
compared with the level in a subject free from breast cancer or a reference level. If desired, 
the comparison can be performed with a matched sample from the same subject, taken from 
a portion of the body not affected by breast cancer. 

Any suitable immunoassay can be used, including, without limitation, competitive 
and non-competitive assay systems using techniques such as western blots, 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), "sandwich" 
immunoassays, immunoprecipitation assays, precipitin reactions, gel diffusion precipitin 
reactions, immunodiffusion assays, agglutination assays, complement-fixation assays, 
immunoradiometric assays, fluorescent immunoassays and protein A immunoassays. 

For example, a BPI can be detected in a fluid sample (e.g., serum or plasma, CSF, 
blood, unne, or tissue homogenate) by means of a two-step sandwich assay. In the first step, 
a capture reagent (e.g., an anti-API antibody) is used to capture the BPI. Examples of such 
antibodies known in the art are set forth in Table XI. The capture reagent can optionally be 
immobilized on a solid phase. In the second step, a directly or indirectly labeled detection 
reagent is used to detect the captured BPI. In one embodiment, the detection reagent is a 
lectin. Any lectin can be used for this purpose that preferentially binds to the BPI rather 
than to other isoforms that have the same core protein as the BPI or to other proteins that 
share the antigenic determinant recognized by the antibody. In a preferred embodiment, the 
chosen lectin binds to the BPI with at least 2-fold greater affinity, more preferably at least 5- 
fold greater affinity, still more preferably at least 10-fold greater affinity, than to said other 
isoforms that have the same core protein as the BPI or to said other proteins that share the 
antigenic determinant recognized by the antibody. Based on the present description, a lectin 
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that is suitable for detecting a given BPI can readily be identified by methods well known in 
the art, for instance upon testing one or more lectins enumerated in Table 1 on pages 158-159 
of Sumar et al., Lectins as Indicators of Disease- Associated Glycoforms, In: Gabius H-J & 
Gabius S (eds.), 1993, Lectins and Glycobiology, at pp. 158-174 (which is incorporated 
5 herein by reference in its entirety). Lectins with the desired oligosaccharide specificity can 
be identified, for example, by their ability to detect the BPI in a 2D gel, in a replica of a 2D 
gel following transfer to a suitable solid substrate such as a nitrocellulose membrane, or in a 
two-step assay following capture by an antibody. In an alternative embodiment, the 
detection reagent is an antibody, e.g., an antibody that immunospecifically detects other 
10 post-translational modifications, such as an antibody that immunospecifically binds to 
phosphorylated amino acids. Examples of such antibodies include those that bind to 
phosphotyrosine (BD Transduction Laboratories, catalog nos.: PI 1230-050/PI 1230-1 50; 

PI 1 120; P38820; P39020), those that bind to phosphoserine (Zymed Laboratories Inc., 

South San Francisco, CA, catalog no. 61-8100) and those that bind to phosphothreonine 
15 (Zymed Laboratories Inc., South San Francisco, CA, catalog nos. 71-8200, 13-9200). 

If desired, a gene encoding a BPI, a related gene, or related nucleic acid sequences or 
subsequences, including complementary sequences, can also be used in hybridization assays. 
A nucleotide encoding a BPI, or subsequences thereof comprising at least 8 nucleotides, 
preferably at least 12 nucleotides, and most preferably at least 15 nucleotides can be used as 
20 a hybridization probe. Hybridization assays can be used for detection, prognosis, diagnosis, 
or monitoring of conditions, disorders, or disease states, associated with aberrant expression 
of genes encoding BPIs, or for differential diagnosis of subjects with signs or symptoms 
suggestive of breast cancer. In particular, such a hybridization assay can be earned out by a 
method comprising contacting a subject’s sample containing nucleic acid with a nucleic acid 
25 probe capable of hybridizing to a DNA or RNA that encodes a BPI, under conditions such 
that hybridization can occur, and detecting or measuring any resulting hybridization. 
Nucleotides can be used for therapy of subjects having breast cancer, as described below. 

The invention also provides diagnostic kits, comprising an anti-BPI antibody. In 
addition, such a kit may optionally comprise one or more of the following: (1) instructions 
30 for using the anti-BPI antibody for diagnosis, prognosis, therapeutic monitoring or any 

combination of these applications; (2) a labeled binding partner to the antibody; (3) a solid 
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phase (such as a reagent strip) upon which the anti-BPI antibody is immobilized; and (4) a 
label or insert indicating regulatory approval for diagnostic, prognostic or therapeutic use or 
any combination thereof. If no labeled binding partner to the antibody is provided, the anti- 
BPI antibody itself can be labeled with a detectable marker, e.g a chemiluminescent, 
enzymatic, fluorescent, or radioactive moiety. 

The invention also provides a kit comprising a nucleic acid probe capable of 
hybridizing to RNA encoding a BPI. In a specific embodiment, a kit comprises in one or 
more containers a pair of primers (e.g., each in the size range of 6-30 nucleotides, more 
preferably 10-30 nucleotides and still more preferably 10-20 nucleotides) that under 
appropriate reaction conditions can prime amplification of at least a portion of a nucleic acid 
encoding a BPI, such as by polymerase chain reaction (see, e.g., Innis et al., 1990, PCR 
Protocols, Academic Press, Inc., San Diego, CA), ligase chain reaction (see EP 320,308) use 
of QP replicase, cyclic probe reaction, or other methods known in the art. 

Kits are also provided which allow for the detection of a plurality of BPIs or a 
plurality of nucleic acids each encoding a BPI. A kit can optionally further comprise a 
predetermined amount of an isolated BPI protein or a nucleic acid encoding a BPI, e.g., for 
use as a standard or control. 

5.3 Statistical T echniques for Identifying BPIs and BPI Clusters 

The uru-variate differential analysis tools, such as fold changes, wilcoxon rank- sum 
test and t-test, are useful in identifying individual BFs or BPIs that are diagnostically 
associated with breast cancer or in identifying individual BPIs that regulate the disease 
process. In most cases, however, those skilled in the art appreciate that the disease process 
is associated with a combination of BFs or BPIs (and to be regulated by a combination of 
BPIs), rather than individual BFs and BPIs in isolation. The strategies for discovering such 
combinations of BFs and BPIs differ from those for discovering individual BFs and BPIs. 

In such cases, each individual BF and BPI can be regarded as one variable and the disease 
can be regarded as a joint, multi-variate effect caused by interaction of these variables. 

The following steps can be used to identify markers from data produced by the 
Preferred Technology. 
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The first step is to identify a collection of BFs or BPIs that individually show 
significant association with breast cancer. The association between the identified BFs or 
BPIs and breast cancer need not be as highly significant as is desirable when an individual 
BF or BPI is used as a diagnostic. Any of the tests discussed above (fold changes, wilcoxon 
5 rank-sum test, etc.) can be used at this stage. Once a suitable collection of BFs or BPIs has 
been identified, a sophisticated multi-variate analysis capable of identifying clusters can then 
be used to estimate the significant multivariate associations with breast cancer. 

Linear Discriminant Analysis (LDA) is one such procedure, which can be used to 
detect significant association between a cluster of variables (i.e., BFs or BPIs) and breast 
10 cancer. In performing LDA, a set of weights is associated with each variable (*'.&> BF or 

BPI) so that the linear combination of weights and the measured values of the variables can 
identify the disease state by discriminating between subjects having breast cancer and 
subjects free from breast cancer. Enhancements to the LDA allow stepwise inclusion (or 
removal) of variables to optimize the discriminant power of the model. The result of the 
1 5 LDA is therefore a cluster of BFs or BPIs which can be used, without limitation, for 

diagnosis, prognosis, therapy or drug development. Other enhanced variations of LDA, such 
as Flexible Discriminant Analysis permit the use of non-linear combinations of variables to 
discriminate a disease state from a normal state. The results of the discriminant analysis can 
be verified by post-hoc tests and also by repeating the analysis using alternative techniques 
20 such as classification trees. 

A further category of BFs or BPIs can be identified by qualitative measures by 
comparing the percentage feature presence of an BF or BPI of one group of samples (e.g., 
samples from diseased subjects) with the percentage feature presence of an BF or BPI in 
another group of samples (e.g., samples from control subjects). The "percentage feature 
25 presence" of an BF or BPI is the percentage of samples in a group of samples in which the 
BF or BPI is detectable by the detection method of choice. For example, if an BF is 
detectable in 95 percent of samples from diseased subjects, the percentage feature presence 
of that BF in that sample group is 95 percent. If only 5 percent of samples from non- 
diseased subjects have detectable levels of the same BF, detection of that BF in the sample 
30 of a subject would suggest that it is likely that the subject suffers from breast cancer. 
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5.4 Use in Clinical Studies 

The diagnostic methods and compositions of the present invention can assist in 
monitoring a clinical study, e.g. to evaluate drugs for therapy of breast cancer. In one 
embodiment, candidate molecules are tested for their ability to restore BF or BPI levels in a 
subject having breast cancer to levels found in subjects free from breast cancer or, in a 
treated subject, to preserve BF or BPI levels at or near non-breast cancer values. The levels 
of one or more BFs or BPIs can be assayed. 

In another embodiment, the methods and compositions of the present invention are 
used to screen candidates for a clinical study to identify individuals having breast cancer; 
such individuals can then be either excluded from or included in the study or can be placed 
in a separate cohort for treatment or analysis. If desired, the candidates can concurrently be 
screened to identify individuals with breast cancer; procedures for these screens are well 
known in the art. 

In another embodiment, the methods and compositions of the present invention are 
used to screen for individuals most likely to respond to treatment with a given breast cancer 
therapeutic agent (e.g. patients displaying a breast cancer antigen for which a specific 
antibody therapy has been developed) 

5.5 Purification of BPIs 

In particular aspects, the invention provides isolated mammalian BPIs, preferably 
human BPIs, and fragments thereof which comprise an antigenic determinant (i.e., can be 
recognized by an antibody) or which are otherwise functionally active, as well as nucleic 
acid sequences encoding the foregoing. "Functionally active" as used herein refers to 
material displaying one or more functional activities associated with a full-length (wild-type) 
BPI, e.g., binding to a BPI substrate or BPI binding partner, antigenicity (binding to an anti- 
BPI antibody), immunogenicity, enzymatic activity and the like. 

In specific embodiments, the invention provides fragments of a BPI comprising at 
least 5 amino acids, at least 10 amino acids, at least 50 amino acids, or at least 75 amino 
acids. Fragments lacking some or all of the regions of a BPI are also provided, as are 
proteins (e.g., fusion proteins) comprising such fragments. Nucleic acids encoding the 
foregoing are provided. 
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Once a recombinant nucleic acid which encodes the BPI, a portion of the BPI, or a 
precursor of the BPI is identified, the gene product can be analyzed. This is achieved by 
assays based on the physical or functional properties of the product, including radioactive 
labeling of the product followed by analysis by gel electrophoresis, immunoassay, etc. 

5 The BPIs identified herein can be isolated and purified by standard methods 

including chromatography (e.g., ion exchange, affinity, and sizing column chromatography), 
centrifugation, differential solubility, or by any other standard technique for the purification 
of proteins. 

Alternatively, once a recombinant nucleic acid that encodes the BPI is identified, the 
10 entire amino acid sequence of the BPI can be deduced from the nucleotide sequence of the 

gene coding region contained in the recombinant nucleic acid. As a result, the protein can be 
synthesized by standard chemical methods known in the art ( e.g ., see Hunkapiller et al., 

1984, Nature 310:105-111). 

In another alternative embodiment, native BPIs can be purified from natural sources, 

1 5 by standard methods such as those described above (e.g. , immunoaffinity purification). 

In a preferred embodiment, BPIs are isolated by the Preferred Technology described 
supra. For preparative-scale runs, a narrow-range "zoom gel" having a pH range of 2 pH 
units or less is preferred for the isoelectric step, according to the method descnbed in 
Westermeier, 1993, Electrophoresis in Practice (VCH, Weinheim, Germany), pp. 197-209 
20 (which is incorporated herein by reference in its entirety); this modification permits a larger 
quantity of a target protein to be loaded onto the gel, and thereby increases the quantity of 
isolated BPI that can be recovered from the gel. When used in this way for preparative-scale 
runs, the Preferred Technology typically provides up to 100 ng, and can provide up to 1000 
ng, of an isolated BPI in a single run. Those of skill in the art will appreciate that a zoom 
25 gel can be used in any separation strategy which employs gel isoelectnc focusing. 

The invention thus provides an isolated BPI, an isolated BPI-related polypeptide, and 
an isolated derivative or fragment of a BPI or a BPI-related polypeptide; any of the 
foregoing can be produced by recombinant DNA techniques or by chemical synthetic 

methods. 

30 

5.6 Isolation of DNA Encoding a BPI 
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Specific embodiments for the cloning of a gene encoding a BPI, are presented below 
by way of example and not of limitation. 

The nucleotide sequences of the present invention, including DNA and RNA, and 
comprising a sequence encoding a BPI or a fragment thereof, or a BPI-related polypeptide, 
may be synthesized using methods known in the art, such as using conventional chemical 
approaches or polymerase chain reaction (PCR) amplification. The nucleotide sequences of 
the present invention also permit the identification and cloning of the gene encoding a BPI 
homolog or BPI ortholog including, for example, by screening cDNA libraries, genomic 
libraries or expression libraries. 

For example, to clone a gene encoding a BPI by PCR techniques, anchored 
degenerate oligonucleotides (or a set of most likely oligonucleotides) can be designed for all 
BPI peptide fragments identified as part of the same protein. PCR reactions under a variety 
of conditions can be perfoimed with relevant cDNA and genomic DNAs (e.g., from brain 
tissue or from cells of the immune system) from one or more species. Also vectorette 
reactions can be performed on any available cDNA and genomic DNA using the 
oligonucleotides (which preferably are nested) as above. Vectorette PCR is a method that 
enables the amplification of specific DNA fragments in situations where the sequence of 
only one primer is known. Thus, it extends the application of PCR to stretches of DNA 
where the sequence information is only available at one end. (Arnold C, 1991, PCR 
Methods Appl. 1(1 ):39-42; Dyer KD, Biotechniques, 1995, 19(4):550-2). Vectorette PCR 
may pe performed with probes that are, for example, anchored degenerate oligonucleotides 
(or most likely oligonucleotides) coding for BPI peptide fragments, using as a template a 
genomic library or cDNA library pools. 

Anchored degenerate oligonucleotides (and most likely oligonucleotides) can be 
designed for all BPI peptide fragments. These oligonucleotides may be labelled and 
hybridized to filters containing cDNA and genomic DNA libraries. Oligonucleotides to 
different peptides from the same protein will often identify the same members of the library. 
The cDNA and genomic DNA libraries may be obtained from any suitable or desired 
mammalian species, for example from humans. 

Nucleotide sequences comprising a nucleotide sequence encoding a BPI or BPI 
fragment of the present invention are useful for their ability to hybridize selectively with 
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complementary stretches of genes encoding other proteins. Depending on the application, a 
variety of hybridization conditions may be employed to obtain nucleotide sequences at least 
30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 99% 
identical, or 100% identical, to the sequence of a nucleotide encoding a BPI. 

For a high degree of selectivity, relatively stringent conditions are used to form the 
duplexes, such as low salt or high temperature conditions. As used herein, "highly stringent 
conditions" means hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium 
dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1xSSC/0.1% SDS at 68°C 
(Ausubel F.M. et al., eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green 
Publishing Associates, Inc., and John Wiley & Sons, Inc., New York, at p. 2.10.3; 
incorporated herein by reference in its entirety.) For some applications, less stringent 
conditions for duplex formation are required. As used herein "moderately stringent 
conditions" means washing in 0.2xSSC/0.1% SDS at 42°C (Ausubel et al., 1989, supra). 
Hybridization conditions can also be rendered more stringent by the addition of increasing 
amounts of formamide, to destabilize the hybrid duplex. Thus, particular hybridization 
conditions can be readily manipulated, and will generally be chosen depending on the 
desired results. In general, convenient hybridization temperatures in the presence of 50% 
formamide are: 42°C for a probe which is 95 to 100% identical to the fragment of a gene 
encoding a BPI, 37°C for 90 to 95% identity and 32°C for 70 to 90% identity. 

In the preparation of genomic libraries, DNA fragments are generated, some of 
which will encode parts or the whole of a BPI. Any suitable method for preparing DNA 
fragments may be used in the present invention. For example, the DNA may be cleaved at 
specific sites using various restriction enzymes. Alternatively, one may use DNAse m the 
presence of manganese to fragment the DNA, or the DNA can be physically sheared, as for 
example, by sonication. The DNA fragments can then be separated according to size by 
standard techniques, including but not limited to agarose and polyacrylamide gel 
electrophoresis, column chromatography and sucrose gradient centrifugation. The DNA 
fragments can then be inserted into suitable vectors, including but not limited to plasmids, 
cosmids, bacteriophages lambda or T 4 , and yeast artificial chromosome (YAC). (See, e.g., 
Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York; Glover, D.M. (ed.), 1985, DNA 
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Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II; Ausubel F.M. et 
ai., eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green Publishing Associates, 
Inc., and John Wiley & sons, Inc., New York). The genomic library may be screened by 
nucleic acid hybridization to labeled probe (Benton and Davis, 1977, Science 196:180; 
Grunstein and Hogness, 1975. Proc. Natl. Acad. Sci. IJ.S.A 72:3961). 

Based on the present description, the genomic libraries may be screened with labeled 
degenerate oligonucleotide probes corresponding to the amino acid sequence of any peptide 
of the BPI using optimal approaches well known in the art. Any probe used is at least 1 0 
nucleotides, at least 15 nucleotides, at least 20 nucleotides, at least 25 nucleotides, at least 30 
nucleotides, at least 40 nucleotides, at least 50 nucleotides, at least 60 nucleotides, at least 70 
nucleotides, at least 80 nucleotides, or at least 1 00 nucleotides. Preferably a probe is 1 0 
nucleotides or longer, and more preferably 15 nucleotides or longer. 

As shown in Tables VI, VII, VIII and IX above, some BPIs disclosed herein 
correspond to isoforms of previously identified proteins encoded by genes whose sequences 
are publicly known. To screen such a gene, any probe may be used that is complementary to 
the gene or its complement; preferably the probe is 10 nucleotides or longer, more preferably 
15 nucleotides or longer. When no nucleotide sequence is known that encodes a given BPI, 
degenerate probes can be used for screening. In Table XII, a degenerate set of probes is 
provided for each of the following BPIs: BPI-41, BPI-42, BPI-43, BPI-44, BPI-45, BPI-46, 
BPI-47, BPI-48, BPI-49, BPI-50, BPI-51, BPI-52, BPI-53, BPI-54, BPI-55, BPI-56. In the 
method used for sequencing by mass spectroscopy in the present invention, the following 
sets of amino acids cannot be distinguished since they have the same mass: leucine (L) and 
isoleucine (I); asparagine (N) and two glycines (GG). Furthermore, the mass accuracy of the 
tandem mass spectrometer used for amino acid sequencing in the method of the present 
invention was insufficient to distinguish between the following sets of amino acids: 
phenylalanine (F) and oxidized methionine (M*); tryptophan (W) and the combination of 
aspartic acid and alanine (i.e. DA or AD); tryptophan (W) and the combination of glutamic 
acid (E) and glycine (G) (i.e. EG or GE); tryptophan (W) and the combination of valine (V) 
and serine (S) (i.e. VS or SV). In Table XU, each possible amino acid sequence is listed for 
each sequence determined by mass spectroscopy, and preferred and fully degenerate sets of 
probes for each possible amino acid sequence are provided. 
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Table XU. Amino Acid Sequences and Probes for BPIs 


BF # 

BPI# 

Partial Amino Acid Sequence as Determined by 
Mass Spectrometry 

Preferred Probes 

Degenerate Probes 

Mass of 
singly 
protonated 
peptide 0 

Core 

sequence 1 * 

N-terminal 

Mass* 

C-terminaJ 

Mass'* 

BF-47 



ECQ 

257.192 

506.225 

GAGTGCCAG 

GARTGYCAR 

BF-48 



CQATGFSPR 

226.16 

0 

T GCCAGGCCACC 

GGCTT CAGCCCC 

CGC 

TGYCARGCNACNG 

GNTTYWSNCCNM 

GN 

BF-48 

BPI-47 

1249.65 


226.16 

0 

T GCCAGGCCACC 

GGCAT GAGCCCC 

CGC 

TGYCARGCNACNG 

GNAT GWSNCCNM 

GN 

BF-40 

BPI-48 

1566.75 

DDF 

341.23 

848.378 

GACGACTTC 

GAYGAYTTY 

BF-40 

BPI-48 




848.378 

GACGACATG 

GAYGAYATG 

BF-40 





0 

CTGGAGTTCTTC 

CCCCGC 

YTNGARTTYTTYCC 

NMGN 

BF-40 





0 

AT CGAGTT CTT C 

CCCCGC 

ATHGARTTYTTYCC 

NMGN 


BPI-48 




0 

CTGGAGATGTTC 

CCCCGC 

YTNGARATGTTYC 

CNMGN 

BF-40 

BPI-48 



229.1 



YTNGARTTYAT GC 

CNMGN 

BF-40 

BPI-48 

1037.51 

LEMMPR 


■ 


YTNGARAGTAGTC 

CNMGN 

BF-40 

BPI-48 

1037.51 

IEMFPR 


■ 


ATHGARATGTTYC 

CNMGN 

BF-40 





■ 


ATHGARTTY ATGC 

CNMGN 

BF-40 

BPI-48 

1037.51 

IEMMPR 

229.1 

0 


ATHGARAT GAT GC 

CNMGN 

BF-22 

BPI-49 

1192.59 

AN 




GCNAAY 

BF-22 

BPI-49 

1192.59 

AGG 

BH 



GCNGGNGGN 

BF-43 

BPI-42 

1022.53 

VYQ 

271.166 

361.186 

GTGTACCAG 

GTNTAYCAR 

BF-43 



LLEN 

128.159 

402.223 

CT GCT GGAGAAC 

YTNYTNGARAAY 

BF-43 



LIEN 

128.159 

402.223 

CTGATCGAGAAC 

YTNATHGARAAY 

BF-43 



ILEN 

128.159 

402.223 

AT CCT GGAGAAC 

ATHYTNGARAAY 


41 
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BF-43 

BPI-42 

999.552 

IIEN 

128.159 

402.223 

AT CAT CG AGAAC 

ATHATHGARAAY 

BF-43 

BPI-42 

999.552 

LLEGG 

128.159 

402.223 

CT GCT GGAGGGC 

YTNYTNGARGGNG 







GGC 

GN 

igsm 


B] 


128.159 

402.223 

CT GAT CGAGGGC 

YTNATHGARGGNG 

w 


II 




GGC 

GN 

BF-43 

BPI-42 

999.552 

ILEGG 

128.159 

402.223 

AT CCT GGAGGGC 

ATHYTNGARGGNG 







GGC 

GN 

BF-43 

BPI-42 

999.552 

IIEGG 

128.159 

402.223 

AT CAT CGAGGGC 

ATH ATH G ARGGNG 







GGC 

GN 



1027.43 

PA 

269.064 

590.31 

CCCGCC 

CCNGCN 

n 




mum 


TGCTACTGCCAG 

TGYTAYTGYCARA 

IH 

■ 


1 

■ 


AAG 

AR 

BF-27 



rn 

328.22 

232.128 

CTGG ACGACT AC 

YTN GAY GAYT AYY 

Hi 



1 



CTGAAC 

TNAAY 


BPI-51 

1293.65 

LDDYIN 

328.22 

232.128 

CT GGACGACTAC 

YTNGAYGAYTAYA 

■ 






ATCAAC 

THAAY 


BPI-51 

1293.65 

IDDYLN 

328.22 


AT CG ACGACT AC 

ATHGAYGAYTAYY 

i 






CTGAAC 

TNAAY 

73 

BPI-51 

1293.65 

100 YIN 

328.22 

232.128 

AT CG ACGACT AC 

ATHGAYGAYTAYA 

1 






ATCAAC 

THAAY 


BPI-51 

1293.65 

LDDYLGG 

328.22 


CT GGACG ACT AC 


M 





H 

CTGGGCGGC 


:}&4M 

BPI-51 

1293.65 

LDDYIGG 

328.22 



MzsEsnsnl 

mm 






ATCGGCGGC 

THGGNGGN 

BF-27 

BPI-51 

1293.65 

IDDYLGG 

328.22 



a| 






pH 



BF-27 

BPI-51 

1293.65 

IDDYIGG 

328.22 




■I 

■1 

1H 




1 

|| 

m\ 

iSII 


HAQ 

333.174 

S63.395 

CACGCCCAG 


gjgj|| 

gggj 


3HI 



■I 

■i 

BF-27 

BPI-51 

1480.77 

El 

360.25 

378.412 

GAGATC 

GARATH 

BF-30 

BPI-52 

991.372 

FGPVPR 

318.937 

) 

TTCGGCCCCGTG 

TTYGGNCCNGTNC 







CCCCGC 

3NMGN 

BF-30 

BPI-52 

991.372 

MGPVPR 

318.937 

) 

^TGGGCCCCGTG 

VTGGGNCCNGTNC 







^CCCGC 

3NMGN 

BF-45 

3PI-44 

1042.47 

YCT 

297.13 

*21.17 

rACTGCACC 

rAYTGYACN 

BF-45 

3PI-44 

1210.65 

i/VEE 

121.153 

133.182 

3TGGTG G AGGAG 

3TNGTNGARGAR 

BF-14 

3PI-53 

182.61 

/VLGD 

7 

r 1 1 .46 

rGGCTGGGCGAC 

r GGYTNGGNGAY 

BF-14 

3PI-53 



7 

r 1 1.46 

3 ACGCCCT GGGC 

3AYGCNYTNGGNG 




II 



SAC > 

\Y 


42 
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BF-14 

BF-14 

BF-14 

BF-14 

BF-14 





16 


AGCGT GAT CGGC 

GAO 

WSNGTNATHGGN 

GAY 

CAGTGCGTGGTG 

GACTTCTTCCGC 

CART GYGTNGTNG 

AYTTYTTYMGN 

CAGTGCGTGGTG 

GACATGTTCCGC 

CARTGYGTNGTNG 

AYATGTTYMGN 

CAGTGCGTGGTG 

GACTT CAT GCGC 

CARTGYGTNGTNG 

AYTTYAT GMGN 



BPI-43 1213.65 DALQV 0 


687.255 




BF-44 


GCCGACCTGCAG 

GTG 


BF-44 

BPI-43 

1213.65 

EGLQV 

0 

687.255 


BF-44 

BPI-43 

1213.65 

GELQV 

0 

687.255 


BF-44 

BPI-43 

1213.65 

VSLQV 

0 

687.255 
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BF-4 

4 BPI-4 

3 1213.65 

SVLQV 

0 

687.255 

AGCGT GOT GCA 

GTG 

3 WSNGTNYTNCARG 
TN 

BF-4^ 

BPI-4; 

1213.65 

WIQV 

0 

687.255 

T GGAT CCAGGT C 

3 T GGATHCARGTN 

BF-4^ 

BPI-4C 

1213.65 

DAIQV 

0 

687.255 

GACGCCAT CCAC 
GTG 

3 GAY GCNATHC ARG 
TN 

II 


B 


■ 

u 

GCCGACAT CCAC 
GTG 

GCNGAYATHCARG 

TN 

II 


IfiiiS 


■ 

jgjj 

GAGGGCAT CCAC 

GTG 


BF-44 

BPI-43 

1213.65 


■ 


GGCGAGATCCAC 

GTG 


BF-44 

BPI-43 

1213.65 

VSIQV 

0 


|_ 


BF-44 

BPI-43 

1213.65 

SVIQV 

■ 

gm 



BF-44 

BPI-43 

1190.63 

YFV 

213.987 

567.385 


TAYTTYGTN 

BF-44 

BPI-43 

1190.63 

YMV 

213.987 

567.385 

TACATGGTG 

TAYATGGTN 

BF-44 

BPI-43 

1213.63 

WLQG 

0 

729.296 

T GGCT GCAGGGC 

TGGYTNCARGGN 

BF-44 

BPI-43 

1213.63 

DALQG 

■ 

729.296 

GACGCCCT GCAG 

GGC 

GAYGCNYTNCARG 

GN 

BF-44 

BPI-43 

1213.63 

ADLQG 

■ 

729.296 

GCCGACCT GCAG 
GGC 

GCNGAYYTNCARG 

GN 

BF-44 

BPI-43 

1213.63 

EGLQG 

■ 

729.296 

GAGGGCCT GCAG 

GGC 

GARGGNYTNCARG 

GN 

BF-44 

BPI-43 

1213.63 

GELQG 

■ 

729.296 

GGCG AGCT GCAG 
GGC 

GGNGARYTNCARG 

GN 

BF-44 

BPI-43 

1213.63 

VSLQG 


729.296 

GT GAGCCT GCAG 

GGC 

GTNWSNYTNCARG 

GN 

BF-44 

BPI-43 

1213.63 

SVLQG 


72 9.296 

<\GCGT GCT GCAG 

GGC 

WSNGTNYTNCARG 

GN 

BF-44 

3PI-43 

213.63 


) 

729.296 

rGGATCCAGGGC 

r GGATHCARGGN 

BF-44 1 

BPI-43 1 

1213.63 [ 


■i 

■E 


GAYGCNATHCARG 

3N 

BF-44 E 

3PI-43 1 

1213.63 / 

\0\QG C 

7 

29.296 C 

C 

3CCGACAT CCAG C 
3GC C 

3CNGAYATHCARG 

3N 

BF-44 E 

iPI-43 1 

213.63 E 

:giqg o 

7 

29.296 C 

C 

5AGGGCAT CCAG C 
5GC C 

3ARGGNATHCARG 

SN 

BF-44 E 

IPI-43 1 

213.63 G 

5EIQG 0 

7 

29.296 C 

C 

JGCGAGAT CCAG C 
IGC G 

5GNGARATHCARG 

IN 

BF-44 B 

PI-43 1; 

213.63 V 

SIQG 0 

7 

29.296 G 

G 

T GAGCAT CCAG G 
GC G 

ITNWSNATHCARG 

IN 

BF-44 B 

PI-43 i: 

213.63 S 

VIQG 0 

7 

29.296 A 

GCGTGATCCAG V\ 

/SNGTNATHCARG 


44 
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5 


10 




BF-34 BPI-56 1712.79 EN 292.2 1177.59 GAGAAC GARAAY 

BF-34 BP I -56 1712.79 EGG 292.2 1177.59 GAGGGCGGC GARGGNGGN 

a 777 /$ corresponds to the mass of the neutral peptide (M) with the addition of a single proton (H+) 


bjhe ‘ core sequence’ is a partial amino acid sequence of a peptide eludicated from the interpretaVon of the 


fragment mass spectrum of the peptide. 

c The N-terminal mass of the peptide is the mass between the start of the core sequence and the N-terminus of 
the peptide. This is a neutral mass corresponding to the addition of the constituent amino acid residues 
extending from the N-terminus of the peptide to the core sequence. (In the context of the present description, 
an amino acid residue refers to an amino acid residue of general structure: -NH-CHR-CO-) 
djhe C-terminai mass is the mass between the end of die core sequence and the C-terminus of the peptide. 
This mass corresponds to the addiition of the constitiuent amino acid residues extending from the end of the 
core sequence to the C-terminus of the peptide with the addition of a water molecule (Htf), and a single proton 
(H+). (In the context of the present description , an amino acid residue refers to an amino acid residue of 
general structure: -NH-CHR-CO-) 


In Table XII, supra , the preferred and degenerate sets of probes are described using 
GCG Nucleotide Ambiguity Codes as employed in GCG SeqWeb™ sequence analysis 
software (SeqWeb™ version 1.1, part of Wisconsin Package Version 1 0, Genetics Computer 
Group, Inc.) . These Nucleotide Ambiguity Codes have the following meaning: 
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GCG Code 

Meaning 

A 

A 

C 

C 

G 

G 

T 

T 

U 

T 

M 

A or C 

R 

A or G 

W 

AorT 

S 

C or G 

Y 

CorT 

K 

GorT 

V 

A or C or G 

H 

A or C or T 

D 

A or G or T 

B 

C or G or T 

X 

G or A or T or C 

N 

G or A or T or C 


GCG uses the letter codes for amino acid codes and nucleotide ambiguity proposed 
by IUPAC-IUB. These codes are compatible with the codes used by the EMBL, GenBank, 
and PIR databases . See IUPAC, Commission on Nomenclature of Organic Chemistry. A 
Guide to IUPAC Nomenclature of Organic Compounds (Recommendations 1 993), 
Blackwell Scientific publications, 1993. 

When a library is screened, clones with insert DNA encoding the BPI or a fragment 
thereof will hybridize to one or more members of the corresponding set of degenerate 
oligonucleotide probes (or their complement). Hybridization of such oligonucleotide probes 
to genomic libraries is carried out using methods known in the art. For example, 
hybridization with one of the above-mentioned degenerate sets of oligonucleotide probes, or 
their complement (or with any member of such a set, or its complement) can be performed 
under highly stringent or moderately stringent conditions as defined above, or can be earned 
out in 2X SSC, 1.0% SDS at 50°C and washed using the washing conditions described supra 
for highly stringent or moderately stringent hybridization. 

In yet another aspect of the invention, clones containing nucleotide sequences 
encoding the entire BPI, a fragment of a BPI, a BPI-related polypeptide, or a fragment of a 
BPI-related polypeptide any of the foregoing may also be obtained by screening expression 
libraries. For example, DNA from the relevant source is isolated and random fragments are 
prepared and ligated into an expression vector (e.g., a bacteriophage, plasmid, phagemid or 
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cosmid) such that the inserted sequence in the vector is capable of being expressed by the 
host cell into which the vector is then introduced. Various screening assays can then be used 
to select for the expressed BPI or BPI-related polypeptides. In one embodiment, the various 
anti-BPI antibodies of the invention can be used to identify the desired clones using methods 
5 known in the art. See, for example, Harlow and Lane, 1 988, Antibodies : A Laboratory 
Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, Appendix IV. 
Colonies or plaques from the library are brought into contact with the antibodies to identify 
those clones that bind antibody. 

In an embodiment, colonies or plaques containing DNA that encodes a BPI, a 
10 fragment of a BPI, a BPI-related polypeptide, or a fragment of a BPI-related polypeptide can 
be detected using DYNA Beads according to Olsvick et al., 29th ICAAC, Houston, Tex. 
1989, incorporated herein by reference. Anti-BPI antibodies are crosslinked to tosylated 
DYNA Beads M280, and these antibody-containing beads are then contacted with colonies 
or plaques expressing recombinant polypeptides. Colonies or plaques expressing a BPI or 
1 5 BPI-related polypeptide are identified as any of those that bind the beads. 

Alternatively, the anti-BPI antibodies can be nonspecifically immobilized to a 
suitable support, such as silica or Celite® resin. This material is then used to adsorb to 
bacterial colonies expressing the BPI protein or BPI-related polypeptide as described herein. 

In another aspect, PCR amplification may be used to isolate from genomic DNA a 
20 substantially pure DNA (z. e . , a DNA substantially free of contaminating nucleic acids) 

encoding the entire BPI or a part thereof. Preferably such a DNA is at least 95% pure, more 
preferably at least 99% pure. Oligonucleotide sequences, degenerate or otherwise, that 
correspond to peptide sequences of BPIs disclosed herein can be used as primers. 

PCR can be carried out, e.g by use of a Perkin-Elmer Cetus thermal cycler and Taq 
25 polymerase (Gene Amp® or AmpliTaq DNA polymerase). One can choose to synthesize 

several different degenerate primers, for use in the PCR reactions. It is also possible to vary 
the stringency of hybridization conditions used in priming the PCR reactions, to allow for 
greater or lesser degrees of nucleotide sequence similarity between the degenerate primers 
and the corresponding sequences in the DNA. After successful amplification of a segment 
30 of the sequence encoding a BPI, that segment may be molecularly cloned and sequenced, 
and utilized as a probe to isolate a complete genomic clone. This, in turn, will permit the 
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determination of the gene's complete nucleotide sequence, the analysis of its expression, and 
the production of its protein product for functional analysis, as described infra . 

The gene encoding a BPI can also be identified by mRNA selection by nucleic acid 
hybridization followed by in vitro translation. In this procedure, fragments are used to 
isolate complementary mRNAs by hybridization. Such DNA fragments may represent 
available, purified DNA encoding a BPI of another species ( e.g ., mouse, human). 
Immunoprecipitation analysis or functional assays (e.g., aggregation ability in vitro ; binding 
to receptor) of the in vitro translation products of the isolated products of the isolated 
mRNAs identifies the mRNA and, therefore, the complementary DNA fragments that 
contain the desired sequences. In addition, specific mRNAs may be selected by adsoiption 
of polysomes isolated from cells to immobilized antibodies that specifically recognize a BPI. 
A radiolabelled cDNA encoding a BPI can be synthesized using the selected mRNA (from 
the adsorbed polysomes) as a template. The radiolabelled mRNA or cDNA may then be 
used as a probe to identify the DNA fragments encoding a BPI from among other genomic 
DNA fragments. 

Alternatives to isolating genomic DNA encoding a BPI include, but are not limited 
to, chemically synthesizing the gene sequence itself from a known sequence or making 
cDNA to the mRNA which encodes the BPI. For example, RNA for cDNA cloning of the 
gene encoding a BPI can be isolated from cells which express the BPI. Those skilled in the 
art will understand from the present description that other methods may be used and are 
within the scope of the invention. 

Any suitable eukaryotic cell can serve as the nucleic acid source for the molecular 
cloning of the gene encoding a BPI. The nucleic acid sequences encoding the BPI can be 
isolated from vertebrate, mammalian, primate, human, porcine, bovine, feline, avian, equine, 
canine or murine sources. The DNA may be obtained by standard procedures known in the 
art from cloned DNA (e.g., a DNA "libraiy"), by chemical synthesis, by cDNA cloning, or 
by the cloning of genomic DNA, or fragments thereof, purified from the desired cell. (See, 
e.g., Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York; Glover, D.M. (ed.), 1985, DNA 
Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, U.) Clones derived 
from genomic DNA may contain regulatory and intron DNA regions in addition to coding 
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regions; clones derived from cDNA will contain only exon sequences. 

The identified and isolated gene or cDNA can then be inserted into any suitable 
cloning vector. A large number of vector-host systems known in the art may be used. As 
those skilled in the art will appreciate, the only limitation is that the vector system chosen be 
5 compatible with the host cell used. Such vectors include, but are not limited to, 

bacteriophages such as lambda derivatives, plasmids such as PBR322 or pUC plasmid 
derivatives or the Bluescript vector (Stratagene) or modified viruses such as adenoviruses, 
adeno-associated viruses or retroviruses. The insertion into a cloning vector can be 
accomplished, for example, by ligating the DNA fragment into a cloning vector which has 
10 complementary cohesive termini. However, if the complementary restriction sites used to 

fragment the DNA are not present in the cloning vector, the ends of the DNA molecules may 
be enzymatically modified. Alternatively, any site desired may be produced by ligating 
nucleotide sequences (linkers) onto the DNA termini; these ligated linkers may comprise 
specific chemically synthesized oligonucleotides encoding restriction endonuclease 
15 recognition sequences. In an alternative method, the cleaved vector and the gene encoding a 
BPI may be modified by homopolymeric tailing. Recombinant molecules can be introduced 
into host cells via transformation, transfection, infection, electroporation, etc., so that many 
copies of the gene sequence are generated. 

In specific embodiments, transformation of host cells with recombinant DNA 
20 molecules that incorporate the isolated gene encoding the BPI, cDNA, or synthesized DNA 
sequence enables generation of multiple copies of the gene. Thus, the gene may be obtained 
in large quantities by growing transformants, isolating the recombinant DNA molecules 
from the transformants and, when necessary, retrieving the inserted gene from the isolated 
recombinant DNA. 

25 The nucleotide sequences of the present invention include nucleotide sequences 

encoding amino acid sequences with substantially the same amino acid sequences as native 
BPIs, nucleotide sequences encoding amino acid sequences with functionally equivalent 
amino acids, nucleotide sequences encoding BPIs, a fragments of BPIs, BPI-related 
polypeptides, or fragments of BPI-related polypeptides. 

30 In a specific embodiment, an isolated nucleic acid molecule encoding a BPI-related 

polypeptide can be created by introducing one or more nucleotide substitutions, additions or 
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deletions into the nucleotide sequence of a BPI such that one or more amino acid 
substitutions, additions or deletions are introduced into the encoded protein. Standard 
techniques known to those of skill in the art can be used to introduce mutations, including, 
for example, site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, 
conservative amino acid substitutions are made at one or more predicted non-essential amino 
acid residues. A "conservative amino acid substitution" is one in which the amino acid 
residue is replaced with an amino acid residue having a side chain with a similar charge. 
Families of amino acid residues having side chains with similar charges have been defined 
in the art. These families include amino acids with basic side chains {e.g., lysine, arginine, 
histidine), acidic side chains ( e.g . , aspartic acid, glutamic acid), uncharged polar side chains 
(*•£•» glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side 
chains {e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tiyptophan), beta-branched side chains {e.g . , threonine, valine, isoleucine) and aromatic side 
chains {e.g., tyrosine, phenylalanine, tiyptophan, histidine). Alternatively, mutations can be 
introduced randomly along all or part of the coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened for biological activity to identify 
mutants that retain activity. Following mutagenesis, the encoded protein can be expressed 
and the activity of the protein can be determined. 

5.7* Expression of DNA Encoding BPTs 
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The nucleotide sequence coding for a BPI, a BPI analog, a BPI-related peptide, or a 
fragment or other derivative of any of the foregoing, can be inserted into an appropriate 
expression vector, i.e., a vector which contains the necessary elements for the transcription 
and translation of the inserted protein-coding sequence. The necessaiy transcriptional and 
5 translational signals can also be supplied by the native gene encoding the BPI or its flanking 
regions, or the native gene encoding the BPI-related polypeptide or its flanking regions. A 
variety of host-vector systems may be utilized in the present invention to express the 
protein-coding sequence. These include but are not limited to mammalian cell systems 
infected with virus (e.g. t vaccinia virus, adenovirus, etc.); insect cell systems infected with 
1 0 virus (eg. , baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
elements of vectors vary in their strengths and specificities. Depending on the host-vector 
system utilized, any one of a number of suitable transcription and translation elements may 
be used. In specific embodiments, a nucleotide sequence encoding a human gene (or a 
15 nucleotide sequence encoding a functionally active portion of a huma BPI) is expressed. In 
yet another embodiment, a fragment of a BPI comprising a domain of the BPI is expressed. 

Any of the methods previously described for the insertion of DNA fragments 
into a vector may be used to construct expression vectors containing a chimeric gene 
consisting of appropriate transcriptional and translational control signals and the protein 
20 coding sequences. These methods may include in vitro recombinant DNA and synthetic 
techniques and in vivo recombinants (genetic recombination). Expression of nucleic acid 
sequence encoding a BPI or fragment thereof may be regulated by a second nucleic acid 
sequence so that the BPI or fragment is expressed in a host transformed with the 
recombinant DNA molecule. For example, expression of a BPI may be controlled by any 
25 promoter or enhancer element known in the art. Promoters which may be used to control the 
expression of the gene encoding a BPI or a BPI-related polypeptide include, but are not 
limited to, the SV40 early promoter region (Bemoist and Chambon, 1981, Nature 290:304- 
310), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto, et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter (Wagner 
30 et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences of the 
metallothionein gene (Brinster et al., 1982, Nature 296:39-42), the tetracycline (Tet) 
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promoter (Gossen et al., 1995, Proc. Nat. Acad. Sci. USA 89:5547-5551); prokaiyotic 
expression vectors such as the p-lactamase promoter (Villa-Kamaroff, et al., 1978, Proc. 
Natl. Acad. Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer, et al., 1983, Proc. Natl. 
Acad. Sci. U.S.A. 80:21-25; see also "Useful proteins from recombinant bacteria" in 


5 


10 


15 


20 


25 


30 


Scientific American, 1980, 242:74-94); plant expression vectors comprising the nopaline 
synthetase promoter region (Herrera-Estrella et al., Nature 303:209-213) or the cauliflower 
mosaic virus 35S RNA promoter (Gardner, et al., 1981, Nucl. Acids Res. 9:2871), and the 
promoter of the photosynthetic enzyme ribulose biphosphate carboxylase (Herrera-Estrella 
et al., 1984, Nature 310:1 15-120); promoter elements from yeast or other fungi such as the 
Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter, and the following animal transcriptional control 
regions, which exhibit tissue specificity and have been utilized in transgenic animals: 
elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984, 
Cell 38:639-64 6; Omitz et al., 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987, Hepatology 7:425-515); insulin gene control region which is active in 
pancreatic beta cells (Hanahan, 1985, Nature 315:1 15-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames 
et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), 
mouse mammary tumor virus control region which is active in testicular, breast, lymphoid 
and mast cells (Leder et al., 1986, Cell 45:485-495), albumin gene control region which is 
active in liver (Pinkert et al., 1987, Genes and Devel. 1 :268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; 
Hammer et al., 1987, Science 235:53-58; alpha 1 -antitrypsin gene control region which is 
active in the liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene 
control region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; 
Kollias et al., 1986, Cell 46:89-94; myelin basic protein gene control region which is active 
in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703-712); myosin light 
chain-2 gene control region which is active in skeletal muscle (Sani, 1985, Nature 314:283- 
286); neuronal -specific enolase (NSE) which is active in neuronal cells (Morelli et al., 1999, 
Gen. Virol. 80:571-83); brain-derived neurotrophic factor (BDNF) gene control region 
which is active in neuronal cells (Tabuchi et al., 1998, Biochem. Biophysic. Res. Com. 
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253:818-823); glial fibrillary acidic protein (GFAP) promoter which is active in astrocytes 
(Gomes et al., 1999, Braz J Med Biol Res 32(5):6 19-631; Morelli et aL, 1999, Gen. Virol. 
80:571-83) and gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et al., 1986, Science 234:1372-1378). 

5 In a specific embodiment, a vector is used that comprises a promoter operably linked 

to a BPI-encoding nucleic acid, one or more origins of replication, and, optionally, one or 
more selectable markers (e.g., an antibiotic resistance gene). 

In a specific embodiment, an expression construct is made by subcloning a BPI or a 
BPI-related polypeptide coding sequence into the EcoRI restriction site of each of the three 
10 pGEX vectors (Glutathione S-Transferase expression vectors; Smith and Johnson, 1988, 

Gene 7:3 1-40). This allows for the expression of the BPI product or BPI-related polypeptide 
from the subclone in the correct reading frame. 

In mammalian host cells, a number of viral-based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, the BPI coding 
1 5 sequence or BPI-related polypeptide coding sequence may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro or 
in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., region 
El or E3) will result in a recombinant virus that is viable and capable of expressing the 
20 antibody molecule in infected hosts, (e g., see Logan & Shenk, 1984, Proc. Natl. Acad. Sci. 
USA 81 :355-359). Specific initiation signals may also be required for efficient translation 
of inserted antibody coding sequences. These signals include the ATG initiation codon and 
adjacent sequences. Furthermore, the initiation codon must be in phase with the reading 
frame of the desired coding sequence to ensure translation of the entire insert. These 
25 exogenous translational control signals and initiation codons can be of a variety of origins, 
both natural and synthetic. The efficiency of expression may be enhanced by the inclusion 
of appropriate transcription enhancer elements, transcription terminators, etc. (see Bittner et 
al., 1987 , Methods in Enzymol. 153:51-544). 

Expression vectors containing inserts of a gene encoding a BP I or a BPI-related 
30 polypeptide can be identified by three general approaches: (a) nucleic acid hybridization, (b) 
presence or absence of ’’marker" gene functions, and (c) expression of inserted sequences. In 
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the first approach, the presence of a gene encoding a BPI inserted in an expression vector 
can be detected by nucleic acid hybridization using probes comprising sequences that are 
homologous to an inserted gene encoding a BPI. In the second approach, the recombinant 
vector/host system can be identified and selected based upon the presence or absence of 
certain "marker" gene functions ( e.g ., thymidine kinase activity, resistance to antibiotics, 
transformation phenotype, occlusion body formation in baculovirus, etc.) caused by the 
insertion of a gene encoding a BPI in the vector. For example, if the gene encoding the BPI 
is inserted within the marker gene sequence of the vector, recombinants containing the gene 
encoding the BPI insert can be identified by the absence of the marker gene function. In the 
third approach, recombinant expression vectors can be identified by assaying the gene 
product (i.c., BPI) expressed by the recombinant. Such assays can be based, for example, on 
the physical or functional properties of the BPI in in vitro assay systems, e.g., binding with 
anti-BPI antibody. 

In addition, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion 
desired. Expression from certain promoters can be elevated in the presence of certain 
inducers; thus, expression of the genetically engineered BPI or BPI-related polypeptide may 
be controlled. Furthermore, different host cells have characteristic and specific mechanisms 
for the translational and post-translational processing and modification ( e.g . , glycosylation, 
phosphorylation of proteins). Appropriate cell lines or host systems can be chosen to ensure 
the desired modification and processing of the foreign protein expressed. For example, 
expression in a bacterial system will produce an unglycosylated product and expression in 
yeast will produce a glycosylated product. Eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, glycosylation, and 
phosphorylation of the gene product may be used. Such mammalian host cells include but 
are not limited to CHO, VERY, BHK, Hela, COS, MDCK, 293, 3T3, WI38, and in 
particular, neuronal cell lines such as, for example, SK-N-AS, SK-N-FI, SK-N-DZ human 
neuroblastomas (Sugimoto et al., 1984, J. Natl. Cancer Inst. 73: 51-57), SK-N-SH human 
neuroblastoma (Biochim. Biophys. Acta, 1982, 704: 450-460), Daoy human cerebellar 
medulloblastoma (He et al., 1992, Cancer Res. 52: 1144-1148) DBTRG-05MG glioblastoma 
cells (Kruse et ah, 1992, In Vitro Cell. Dev. Biol. 28A: 609-614), IMR-32 human 
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neuroblastoma (Cancer Res., 1970, 30: 2110-2118), 132 INI human astrocytoma (Proc. Natl 
Acad. Sci. USA ,1977, 74: 4816), MOG-G-CCM human astrocytoma (Br. J. Cancer, 1984, 
49: 269), U87MG human glioblastoma-astrocytoma (Acta Pathol. Microbiol. Scand., 1968, 
74 : 465-486), A172 human glioblastoma (Olopade et al., 1992, Cancer Res. 52: 2523-2529), 
5 C6 rat glioma cells (Benda et al., 1968, Science 161: 370-371), Neuro-2a mouse 
neuroblastoma (Proc. Natl. Acad. Sci. USA, 1970, 65: 129-136), NB41 A3 mouse 
neuroblastoma (Proc. Natl. Acad. Sci. USA, 1962, 48: 1184-1 190), SCP sheep choroid 
plexus (Bolin et al., 1994, J. Virol. Methods 48: 21 1-221), G355-5, PG-4 Cat normal 
astrocyte (Haapala et al., 1985, J. Virol. 53: 827-833), Mpf ferret brain (Trowbridge et al., 

10 1982, In Vitro 18: 952-960), and normal cell lines such as, for example, CTX TNA2 rat 

normal cortex brain (Radany et al., 1992, Proc. Natl. Acad. Sci. USA 89: 6467-6471) such 
as, for example, CRL7030 and Hs578Bst. Furthermore, different vector/host expression 
systems may effect processing reactions to different extents. 

For long-term, high-yield production of recombinant proteins, stable expression is 
15 preferred. For example, cell lines which stably express the differentially expressed or 
pathway gene protein may be engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells can be transformed with DNA controlled by 
appropriate expression control elements (e.g . , promoter, enhancer, sequences, transcription 
terminators, polyadenylation sites, etc.), and a selectable marker. Following the introduction 
20 of the foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched 
medium, and then are switched to a selective medium. The selectable marker in the 
recombinant plasmid confers resistance to the selection and allows cells to stably integrate 
the plasmid into their chromosomes and grow to form foci which in turn can be cloned and 
expanded into cell lines. This method may advantageously be used to engineer cell lines 
25 which express the differentially expressed or pathway gene protein. Such engineered cell 
lines may be particularly useful in screening and evaluation of compounds that affect the 
endogenous activity of the differentially expressed or pathway gene protein. 

A number of selection systems may be used, including but not limited to the herpes 
simplex virus thymidine kinase (Wigler, et al., 1977, Cell 1 1:223), hypoxanthine-guanine 
30 phosphoribosyltransferase (Szybalska & Szybalski, 1 962, Proc. Natl. Acad. Sci. USA 

48:2026), and adenine phosphoribosyltransferase (Lowy, et al., 1980, Cell 22:817) genes can 
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be employed in tk , hgprt or aprt cells, respectively. Also, antimetabolite resistance can be 
used as the basis of selection for dhfr, which confers resistance to methotrexate (Wigler, et 
al-» 1980 > Nati * Acad - Sci. USA 77:3567; O'Hare, et al, 1981, Proc. Natl. Acad. Sci. USA 
78:1527); gpt, which confers resistance to mycophenolic acid (Mulligan & Berg, 1981, Proc. 
Natl. Acad. Sci. USA 78:2072); neo, which confers resistance to the aminoglycoside G-418 
(Colberre-GarBPIn, et al., 1981, J. Mol. Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et aL, 1 984, Gene 30: 1 47) genes. 

In other specific embodiments, the BPI, fragment, analog, or derivative may be 
expressed as a fusion, or chimeric protein product (comprising the protein, fragment, analog, 
or derivative joined via a peptide bond to a heterologous protein sequence). For example, 
the polypeptides of the present invention may be fused with the constant domain of 
immunoglobulins (IgA, IgE, IgG, IgM), or portions thereof (CHI, CH2, CH3, or any 
combination thereof and portions thereof) resulting in chimeric polypeptides. Such fusion 
proteins may facilitate purification, increase half-life in vivo , and enhance the delivery of an 
antigen across an epithelial barrier to the immune system. An increase in the half-life in vivo 
and facilitated purification has been shown for chimeric proteins consisting of the first two 
domains of the human CD4-polypeptide and various domains of the constant regions of the 
heavy or light chains of mammalian immunoglobulins. See, e.g., EP 394,827; Traunecker et 
a 1 * Nature^ 33 1 :84-86 (1988). Enhanced deliveiy of an antigen across the epithelial barrier 
to the immune system has been demonstrated for antigens (e.g., insulin) conjugated to an 
FcRn binding partner such as IgG or Fc fragments (see, e.g., PCT publications WO 
96/22024 and WO 99/04813). 

Nucleic acids encoding a BPI, a fragment of a BPI, a BPI-related polypeptide, or a 
fragment of a BPI-related polypeptide can fused to an epitope tag (e.g., the hemagglutinin 
( HA ) tag or flag tag) to aid in detection and purification of the expressed polypeptide. For 
example, a system described by Janknecht et al. allows for the ready purification of 
non-denatured fusion proteins expressed in human cell lines (Janknecht et al., 1991, Proc. 

Natl. Acad. Sci. USA 88:8972-897). 

Fusion proteins can be made by ligating the appropriate nucleic acid sequences 
encoding the desired amino acid sequences to each other by methods known in the art, in the 
proper coding frame, and expressing the chimeric product by methods commonly known in 
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the art. Alternatively, a fusion protein may be made by protein synthetic techniques, e.g ., by 
use of a peptide synthesizer. 

Both cDNA and genomic sequences can be cloned and expressed. 

5 5.8 Domain Structure of BPIs 

Domains of some BPIs are known in the art and have been described in the scientific 
literature. Moreover, domains of a BPI can be identified using techniques known to those of 
skill in the art. For example, one or more domains of a BPI can be identified by using one or 
more of the following programs: ProDom, TMpred, and SAPS. ProDom compares the 
1 0 amino acid sequence of a polypeptide to a database of compiled domains (see, e.g., 

http://www.toulouse.inra.fr/prodom.html: Corpet F., Gouzy J. & Kahn D., 1999, Nucleic 
Acids Res., 27:263-267). TMpred predicts membrane-spanning regions of a polypeptide 
and their orientation. This program uses an algorithm that is based on the statistical analysis 
of TMbase, a database of naturally occuring transmembrane proteins (see, e.g., 

15 http://www.ch.embnet.org/software/TMPRFD form.html ; Hofmann & StofFel. (1993) 
“TMbase - A database of membrane spanning proteins segments.” Biol. Chem. 

Hoppe-Seyler 347,166). The SAPS program analyzes polypeptides for statistically 
significant features like charge-clusters, repeats, hydrophobic regions, compositional 
domains (see, e.g., Brendel et al., 1992, Proc. Natl. Acad. Sci. USA 89: 2002-2006). Thus, 
20 based on the present description, the skilled artisan can identify domains of a BPI having 
enzymatic or binding activity, and further can identify nucleotide sequences encoding such 
domains. These nucleotide sequences can then be used for recombinant expression of a BPI 
fragment that retains the enzymatic or binding activity of the BPI. 

Based on the present description, the skilled artisan can identify domains of a BPI 
25 having enzymatic or binding activity, and further can identify nucleotide sequences 

encoding such domains. These nucleotide sequences can then be used for recombinant 
expression of BPI fragments that retain the enzymatic or binding activity of the BPI. 

In one embodiment, a BPI has an amino acid sequence sufficiently similar to an 
identified domain of a known polypeptide. As used herein, the term "sufficiently . similar" 

30 refers to a first amino acid or nucleotide sequence which contains a sufficient number of 

identical or equivalent (e.g., with a similar side chain) amino acid residues or nucleotides to 


57 



WO 01/13117 


PCT/GB00/03143 


a second amino acid or nucleotide sequence such that the first and second amino acid or 
nucleotide sequences have or encode a common structural domain or common functional 
activity or both. 

A BPI domain can be assessed for its function using techniques well known to those 
of skill in the art. For example, a domain can be assessed for its kinase activity or for its 
ability to bind to DNA using techniques known to the skilled artisan. Kinase activity can be 
assessed, for example, by measuring the ability of a polypeptide to phosphoiylate a 
substrate. DNA binding activity can be assessed, for example, by measuring the ability of a 
polypeptide to bind to a DNA binding element in a electromobility shift assay. 

5.9 Production of Antibodies to BPIs 

According to the invention a BPI, BPI analog, BPI-related protein or a fragment or 
derivative of any of the foregoing may be used as an immunogen to generate antibodies 
which immunospecifically bind such an immunogen. Such immunogens can be isolated by 
any convenient means, including the methods described above. Antibodies of the invention 
include, but are not limited to polyclonal, monoclonal, bispecific, humanized or chimeric 
antibodies, single chain antibodies. Fab fragments and F(ab’) fragments, fragments produced 
by a Fab expression library, anti-idiotypic (anti-id) antibodies, and epitope-binding 
fragments of any of the above. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen binding site that specifically binds an 
antigen. The immunoglobulin molecules of the invention can be of any class (eg., IgG, IgE, 
IgM, IgD and IgA ) or subclass of immunoglobulin molecule. 

In one embodiment, antibodies that recognize gene products of genes encoding BPIs 
are publicly available. For example, antibodies that recognize these BPIs and/or their 
isoforms include antibodies recognizing BPI-5, BPI-10, BPI-1 1, BPI- 13, BPI-21, BPI-23, 
BPI-24, BPI-25, BPI-27, BPI-29, BPI-32, BPI-33, BPI-34 which antibodies can be 
purchased from commercial sources as shown in Table X above. In another embodiment, 
methods known to those skilled in the art are used to produce antibodies that recognize a 
BPI, a BPI analog, a BPI-related polypeptide, or a derivative or fragment of any of the 
foregoing. 
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In one embodiment of the invention, antibodies to a specific domain of a BPI are 
produced. In a specific embodiment, hydrophilic fragments of a BPI are used as 
immunogens for antibody production. 

In the production of antibodies, screening for the desired antibody can be 
5 accomplished by techniques known in the art, e.g. ELISA (enzyme-linked immunosorbent 
assay). For example, to select antibodies which recognize a specific domain of a BPI, one 
may assay generated hybridomas for a product which binds to a BPI fragment containing 
such domain. For selection of an antibody that specifically binds a first BPI homolog but 
which does not specifically bind to (or binds less avidly to) a second BPI homolog, one can 
10 select on the basis of positive binding to the first BPI homolog and a lack of binding to (or 
reduced binding to) the second BPI homolog. Similarly, for selection of an antibody that 
specifically binds a BPI but which does not specifically bind to (or binds less avidly to) a 
different isoform of the same protein (such as a different glycoform having the same core 
peptide as the BPI), one can select on the basis of positive binding to the BPI and a lack of 
1 5 binding to (or reduced binding to) the different isoform (e.g., a different glycoform). Thus, 
the present invention provides an antibody (preferably a monoclonal antibody) that binds 
with greater affinity (preferably at least 2-fold, more preferably at least 5-fold still more 
preferably at least 10-fold greater affinity) to a BPI than to a different isoform or isoforms 
(e.g., glyco forms) of the BPI. 

20 Polyclonal antibodies which may be used in the methods of the invention are 

heterogeneous populations of antibody molecules derived from the sera of immunized 
animals. Unfractionated immune serum can also be used. Various procedures known in the 
art may be used for the production of polyclonal antibodies to a BPI, a fragment of a BPI, a 
BPI-related polypeptide, or a fragment of a BPI-related polypeptide. In a particular 
25 embodiment, rabbit polyclonal antibodies to an epitope of a BPI or a BPI-related 

polypeptide can be obtained. For example, for the production of polyclonal or monoclonal 
antibodies, various host animals can be immunized by injection with the native or a 
synthetic (e.g., recombinant) version of a BPI, a fragment of a BPI, a BPI-related 
polypeptide, or a fragment of a BPI-related polypeptide, including but not limited to rabbits, 
30 mice, rats, etc. The Preferred Technology described herein provides isolated BPIs suitable 
for such immunization. If the BPI is purified by gel electrophoresis, the BPI can be used for 
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immunization with or without prior extraction from the polyacrylamide gel. Various 
adjuvants may be used to enhance the immunological response, depending on the host 
species, including, but not limited to, complete or incomplete Freund's adjuvant, a mineral 
gel such as aluminum hydroxide, surface active substance such as Iysolecithin, pluronic 
polyol, a polyanion, a peptide, an oil emulsion, keyhole limpet hemocyanin, dinitrophenol, 
and an adjuvant such as BCG (bacille Calmette-Guerin) or corynebacterium parvum. 
Additional adjuvants are also well known in the art. 

For preparation of monoclonal antibodies (mAbs) directed toward a BPI, a fragment 
of a BPI, a BPI-related polypeptide, or a fragment of a BPI-related polypeptide, any 
technique which provides for the production of antibody molecules by continuous cell lines 
in culture may be used. For example, the hybridoma technique originally developed by 
Kohler and Milstein (1975, Nature 256:495-497), as well as the trioma technique, the human 
B-cell hybridoma technique (Kozbor et al., 1983, Immunology Today 4:72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies 
may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass 
thereof. The hybridoma producing the mAbs of the invention may be cultivated in vitro or 
in vivo. In an additional embodiment of the invention, monoclonal antibodies can be 
produced in germ-free animals utilizing known technology (PCT/US90/02545, incorporated 
herein by reference). 

The monoclonal antibodies include but are not limited to human monoclonal 
antibodies and chimeric monoclonal antibodies (eg., human-mouse chimeras). A chimeric 
antibody is a molecule in which different portions are derived from different animal species, 
such as those having a human immunoglobulin constant region and a variable region derived 
from a murine mAb. (See, eg., Cabilly et al., U.S. Patent No. 4,816,567; and Boss et al., 

U.S. Patent No. 4,816397, which are incorporated herein by reference in their entirety.) 
Humanized antibodies are antibody molecules from non-human species having one or more 
complementarily determining regions (CDRs) from the non-human species and a framework 
region from a human immunoglobulin molecule. (See, eg., Queen, U.S. Patent No. 
5,585,089, which is incorporated herein by reference in its entirety.) 

Chimeric and humanized monoclonal antibodies can be produced by recombi nan t 
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DNA techniques known in the art, for example using methods described in PCT Publication 
No. WO 87/02671; European Patent Application 184,187; European Patent Application 
171,496; European Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. 
Patent No. 4,816,567; European Patent Application 125,023; Better et al., 1988, Science 
5 240:1041-1043; Liu et al., 1987, Proc. Natl. Acad. Sci. USA 84:3439-3443; Liu et al., 1987, 

J. Immunol. 139:3521-3526; Sun et al., 1987, Proc. Natl. Acad. Sci. USA 84:214-218; 
Nishimura et al., 1987, Cane. Res. 47:999-1005; Wood et al., 1985, Nature 314:446-449; 
and Shaw et al., 1988, J. Natl. Cancer Inst. 80:1553-1559; Morrison, 1985, Science 
229:1202-1207; Oi et al., 1986, Bio/Techniques 4:214; U.S. Patent 5,225,539; Jones et al., 

10 1986, Nature 321:552-525; Verhoeyan et al. (1988) Science 239:1534; and Beidler et al., 

1988, J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly desirable for therapeutic treatment of 
human subjects. Such antibodies can be produced using transgenic mice which are 
incapable of expressing endogenous immunoglobulin heavy and light chains genes, but 
1 5 which can express human heavy and light chain genes. The transgenic mice are immunized 
in the normal fashion with a selected antigen, e.g., all or a portion of a BPI of the invention. 
Monoclonal antibodies directed against the antigen can be obtained using conventional 
hybridoma technology.. The human immunoglobulin transgenes harbored by the transgenic 
mice rearrange during B cell differentiation, and subsequently undergo class switching and 
20 somatic mutation. Thus, using such a technique, it is possible to produce therapeutically 

useful IgG, IgA, IgM and IgE antibodies. For an overview of this technology for producing 
human antibodies, see Lonberg and Huszar (1995, Int. Rev. Immunol. 13:65-93). For a 
detailed discussion of this technology for producing human antibodies and human 
monoclonal antibodies and protocols for producing such antibodies, see, e.g ., U.S. Patent 
25 5,625,126; U.S. Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. 

Patent 5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA) and 
Genpharm (San Jose, CA) can be engaged to provide human antibodies directed against a 
selected antigen using technology similar to that described above. 

Completely human antibodies which recognize a selected epitope can be generated 
30 using a technique referred to as "guided selection." In this approach a selected non-human 
monoclonal antibody, e.g., a mouse antibody, is used to guide the selection of a completely 
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human antibody recognizing the same epitope. (Jespers et al. (1994) Bio/technology 
12:899-903). 

The antibodies of the present invention can also be generated using various phage 
display methods known in the art. In phage display methods, functional antibody domains 
are displayed on the surface of phage particles which carry the polynucleotide sequences 
encoding them. In a particular, such phage can be utilized to display antigen binding 
domains expressed from a repertoire or combinatorial antibody library (e.g., human or 
munne). Phage expressing an antigen binding domain that binds the antigen of interest can 
be selected or identified with antigen, e.g ., using labeled antigen or antigen bound or 
captured to a solid surface or bead. Phage used in these methods are typically filamentous 
phage including fd and M13 binding domains expressed from phage with Fab, Fv or 
disulfide stabilized Fv antibody domains recombinantly fused to either the phage gene in or 
gene VIII protein. Phage display methods that can be used to make the antibodies of the 
present invention include those disclosed in Brinkman et al., J. Immunol. Methods 182:41- 
50 (1995); Ames et al., J. Immunol. Methods 184:177-186 (1995); Kettleborough et al., Eur. 
J. Immunol. 24:952-958 (1994); Persic et al., Gene 187 9-18 (1997); Burton et al., Advances 
in Immunology 57:191-280 (1994); PCT Application No. PCT/GB91/01 134; PCT 
Publications WO 90/02809; WO 91/10737; WO 92/01047; WO 92/18619; WO 93/1 1236; 
WO 95/15982; WO 95/20401; and U.S. Patent Nos. 5,698,426; 5,223,409; 5,403,484; 
5,580,717; 5,427,908; 5,750,753; 5,821,047; 5,571,698; 5,427,908; 5,516,637; 5,780,225; 
5,658,727; 5,733,743 and 5,969,108; each of which is incorporated herein by reference in its 
entirety. 

As described in the above references, after phage selection, the antibody coding 
regions from the phage can be isolated and used to generate whole antibodies, including 
human antibodies, or any other desired antigen binding fragment, and expressed in any 
desired host, including mammalian cells, insect cells, plant cells, yeast, and bacteria, e.g., as 
described in detail below. For example, techniques to recombinantly produce Fab, Fab’ and 
F(ab )2 fragments can also be employed using methods known in the art such as those 
disclosed in PCT publication WO 92/22324; Mullinax et al., BioTechniques 12(6):864-869 
(1992); and Sawai et al., AJRI 34:26-34 (1995); and Better et al.. Science 240:1041-1043 
(1988) (said references incorporated by reference in their entireties). 
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Examples of techniques which can be used to produce single-chain Fvs and 
antibodies include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al. 
Methods in Enzymology 203:46-88 (1991); Shu et al, PNAS 90:7995-7999 (1993); and 
Skerra et al, Science 240:1038-1040 (1988). 

5 The invention further provides for the use of bispecific antibodies, which can be 

made by methods known in the art. Traditional production of full length bispecific 
antibodies is based on the coexpression of two immunoglobulin heavy chain-light chain 
pairs, where the two chains have different specificities (Milstein et al., 1983, Nature l 
305:537-539). Because of the random assortment of immunoglobulin heavy and light 
10 chains, these hybridomas (quadromas) produce a potential mixture of 1 0 different antibody 
molecules, of which only one has the correct bi specific structure. Purification of the correct 
molecule, which is usually done by affinity chromatography steps, is rather cumbersome, 
and the product yields are low. Similar procedures are disclosed in WO 93/08829, published 
13 May 1993, and in Traunecker et al., 1991, EMBO J. 10:3655-3659 . 

1 5 According to a different and more preferred approach, antibody variable domains 

with the desired binding specificities (antibody-antigen combining sites) are fused to 
immunoglobulin constant domain sequences. The fusion preferably is with an | 

immunoglobulin heavy chain constant domain, comprising at least part of the hinge, CH2, 
and CH3 regions. It is preferred to have the first heavy-chain constant region (CHI) 

20 containing the site necessary for light chain binding, present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are 
co-transfected into a suitable host organism. This provides for great flexibility in adjusting 
the mutual proportions of the three polypeptide fragments in embodiments when unequal 
25 ratios of the three polypeptide chains used in the construction provide the optimum yields. It 
is, however, possible to insert the coding sequences for two or all three polypeptide chains in 
one expression vector when the expression of at least two polypeptide chains in equal ratios 
results in high yields or when the ratios are of no particular significance. 

In a preferred embodiment of this approach, the bispecific antibodies are composed 
30 of a hybrid immunoglobulin heavy chain with a first binding specificity in one arm, and a 

hybrid immunoglobulin heavy chain-light chain pair (providing a second binding specificity) 
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in the other arm. It was found that this asymmetric structure facilitates the separation of the 
desired bispecific compound from unwanted immunoglobulin chain combinations, as the 
presence of an immunoglobulin light chain in only one half of the bispecific molecule 
provides for a facile way of separation. This approach is disclosed in WO 94/04690 
published March 3,1994. For further details for generating bispecific antibodies see, for 
example, Suresh et al.. Methods in Enzymology,1986, 121:210. 

The invention provides functionally active fragments, derivatives or analogs of the 
anti-BPI immunoglobulin molecules. Functionally active means that the fragment, 
derivative or analog is able to elicit anti-anti -idio type antibodies (i.e., tertiary antibodies) 
that recognize the same antigen that is recognized by the antibody from which the fragment, 
derivative or analog is derived. Specifically, in a preferred embodiment the antigenicity of 
the idiotype of the immunoglobulin molecule may be enhanced by deletion of framework 
and CDR sequences that are C-terminal to the CDR sequence that specifically recognizes the 
antigen. To determine which CDR sequences bind the antigen, synthetic peptides containing 
the CDR sequences can be used in binding assays with the antigen by any binding assay 
method known in the art. 

The present invention provides antibody fragments such as, but not limited to, F(ab% 
fragments and Fab fragments. Antibody fragments which recognize specific epitopes may 
be generated by known techniques. F(ab’) 2 fragments consist of the variable region, the light 
chain constant region and the CHI domain of the heavy chain and are generated by pepsin 
digestion of the antibody molecule. Fab fragments are generated by reducing the disulfide 
bridges of the F(ab% fragments. The invention also provides heavy chain and light chain 
dimers of the antibodies of the invention, or any minimal fragment thereof such as Fvs or 
single chain antibodies (SCAs) ( e.g ., as described in U.S. Patent 4,946,778; Bird, 1988, 
Science 242:423-42; Huston et al., 1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and 
Ward et al., 1989, Nature 334:544-54), or any other molecule with the same specificity as 
the antibody of the invention. Single chain antibodies are formed by linking the heavy and 
light chain fragments of the Fv region via an amino acid bridge, resulting in a single chain 
polypeptide. Techniques for the assembly of functional Fv fragments in E. coli may be used 
(Skerra et al., 1988, Science 242:1038-1041). 

In other embodiments, the invention provides fusion proteins of the 
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immunoglobulins of the invention (or functionally active fragments thereof), for example in 
which the immunoglobulin is fused via a covalent bond (e.g., a peptide bond), at either the 
N-terminus or the C-terminus to an amino acid sequence of another protein (or portion 
thereof, preferably at least 10, 20 or 50 amino acid portion of the protein) that is not the 
5 immunoglobulin. Preferably the immunoglobulin, or fragment thereof, is covalently linked 
to the other protein at the N-terminus of the constant domain. As stated above, such fusion 
proteins may facilitate purification, increase half-life in vivo, and enhance the delivery of an 
antigen across an epithelial barrier to the immune system. 

The immunoglobulins of the invention include analogs and derivatives that are either 
10 modified, i.e, by the covalent attachment of any type of molecule as long as such covalent 
attachment that does not impair immunospecific binding. For example, but not by way of 
limitation, the derivatives and analogs of the immunoglobulins include those that have been 
further modified, e.g., by glycosylation, acetylation, pegylation, phosphylation, amidation, 
derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to a 
1 5 cellular ligand or other protein, etc. Any of numerous chemical modifications may be 

carried out by known techniques, including, but not limited to specific chemical cleavage, 
acetylation, formylation, etc. Additionally, the analog or derivative may contain one or 
more non-classical amino acids. 

The foregoing antibodies can be used in methods known in the art relating to the 
20 localization and activity of the BPIs of the invention, eg., for imaging these proteins, 

measuring levels thereof in appropriate physiological samples, in diagnostic methods, etc, 

5.10 Expression Of Antibodies 

The antibodies of the invention can be produced by any method known in the art for 
25 the synthesis of antibodies, in particular, by chemical synthesis or by recombinant 
expression, and are preferably produced by recombinant expression technique. 

Recombinant expression of antibodies, or fragments, derivatives or analogs thereof, 
requires construction of a nucleic acid that encodes the antibody. If the nucleotide sequence 
of the antibody is known, a nucleic acid encoding the antibody may be assembled from 
30 chemically synthesized oligonucleotides (e.g., as described in Kutmeier et al., 1994, 
BioTechniques 17:242), which, briefly, involves the synthesis of overlapping 
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oligonucleotides containing portions of the sequence encoding antibody, annealing and 
ligation of those oligonucleotides, and then amplification of the ligated oligonucleotides by 
PCR. 

Altematively, the nucleic acid encoding the antibody may be obtained by cloning the 
antibody. If a clone containing the nucleic acid encoding the particular antibody is not 
available, but the sequence of the antibody molecule is known, a nucleic acid encoding the 
antibody may be obtained from a suitable source (e.g. t an antibody cDNA library, or cDNA 
library generated from any tissue or cells expressing the antibody) by PCR amplification 
using synthetic primers hybridizable to the 3' and 5' ends of the sequence or by cloning using 
an oligonucleotide probe specific for the particular gene sequence. 

If an antibody molecule that specifically recognizes a particular antigen is not 
available (or a source for a cDNA library for cloning a nucleic acid encoding such an 
antibody), antibodies specific for a particular antigen may be generated by any method 
known in the art, for example, by immunizing an animal, such as a rabbit, to generate 
polyclonal antibodies or, more preferably, by generating monoclonal antibodies. 
Alternatively, a clone encoding at least the Fab portion of the antibody may be obtained by 
screening Fab expression libraries (e.g., as described in Huse et al., 1989, Science 246:1275- 
1281) for clones of Fab fragments that bind the specific antigen or by screening antibody 
libraries (See, e.g, Clackson et al., 1991, Nature 352:624; Hane et al., 1997 Proc. Natl. 

Acad. Sci. USA 94:4937). 

Once a nucleic acid encoding at least the variable domain of the antibody molecule is 
obtained, it may be introduced into a vector containing the nucleotide sequence encoding the 
constant region of the antibody molecule (see, e.g., PCT Publication WO 86/05807; PCT 
Publication WO 89/01036; and U.S. Patent No. 5,122,464). Vectors containing the 
complete light or heavy chain for co-expression with the nucleic acid to allow the expression 
of a complete antibody molecule are also available. Then, the nucleic acid encoding the 
antibody can be used to introduce the nucleotide substitution(s) or deletion(s) necessary to 
substitute (or delete) the one or more variable region cysteine residues participating in an 
intrachain disulfide bond with an amino acid residue that does not contain a sulfhydyl group. 
Such modifications can be earned out by any method known in the art for the introduction 
of specific mutations or deletions in a nucleotide sequence, for example, but not limited to. 
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chemical mutagenesis, in vitro site directed mutagenesis (Hutchinson et al., 1978, J. Biol. 
Chem. 253:6551), PCT based methods, etc. 

In addition, techniques developed for the production of "chimeric antibodies" 
(Morrison et al., 1984, Proc. Natl. Acad. Sci. 81:851-855; Neuberger et al., 1984, Nature 
5 312:604-608; Takeda et al., 1985, Nature 314:452-454) by splicing genes from a mouse 

antibody molecule of appropriate antigen specificity together with genes from a human 
antibody molecule of appropriate biological activity can be used. As described supra , a 
chimeric antibody is a molecule in which different portions are derived from different 
animal species, such as those having a variable region derived from a murine mAb and a 
1 0 human antibody constant region, e.g . , humanized antibodies. 

Once a nucleic acid encoding an antibody molecule of the invention has been 
obtained, the vector for the production of the antibody molecule may be produced by 
recombinant DNA technology using techniques well known in the art. Thus, methods for 
preparing the protein of the invention by expressing nucleic acid containing the antibody 
15 molecule sequences are described herein. Methods which are well known to those skilled in 
the art can be used to construct expression vectors containing an antibody molecule coding 
sequences and appropriate transcriptional and translational control signals. These methods 
include, for example, in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. See, for example, the techniques described in Sambrook et al. 

20 (1990, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory, 

Cold Spring Harbor, NY) and Ausubel et al. (eds., 1 998, Current Protocols in Molecular 
Biology, John Wiley & Sons, NY). 

The expression vector is transferred to a host cell by conventional techniques and the 
transfected cells are then cultured by conventional techniques to produce an antibody of the 
25 invention. 

The host cells used to express a recombinant antibody of the invention may be either 
bacterial cells such as Escherichia coli, or, preferably, eukaryotic cells, especially for the 
expression of whole recombinant antibody molecule. In particular, mammalian cells such as 
Chinese hamster ovary cells (CHO), in conjunction with a vector such as the major 
30 intermediate early gene promoter element from human cytomegalovirus is an effective 

expression system for antibodies (Foecking et al., 198, Gene 45:101; Cockett et al., 1990, 
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Bio/Technology 8:2). 

A variety of host-expression vector systems may be utilized to express an antibody 
molecule of the invention. Such host-expression systems represent vehicles by which the 
coding sequences of interest may be produced and subsequently purified, but also represent 
cells which may, when transformed or transfected with the appropriate nucleotide coding 
sequences, express the antibody molecule of the invention in situ. These include but are not 
limited to microorganisms such as bacteria (e.g., E. coli, B. subtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing antibody coding sequences; yeast (e.g., Saccharomyces, Pichia) transformed with 
recombinant yeast expression vectors containing antibody coding sequences; insect cell 
systems infected with recombinant virus expression vectors (e.g., baculovirus) containing 
the antibody coding sequences; plant cell systems infected with recombinant virus 
expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
transformed with recombinant plasmid expression vectors (e.g., Ti plasmid) containing 
antibody coding sequences; or mammalian cell systems (e.g., COS, CHO, BHK, 293, 3T3 
cells) harboring recombinant expression constructs containing promoters derived from the 
genome of mammalian cells (e.g., metallothionein promoter) or from mammalian viruses 
(e.g., the adenovirus late promoter; the vaccinia vims 7.5K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the antibody molecule being expressed. For example, 
when a large quantity of such a protein is to be produced, for the generation of 
pharmaceutical compositions comprising an antibody molecule, vectors which direct the 
expression of high levels of fusion protein products that are readily purified may be 
desirable. Such vectors include, but are not limited, to the E. coli expression vector pUR278 
(Ruther et al., 1983, EMBO J. 2:1791), in which the antibody coding sequence may be 
ligated individually into the vector in frame with the lac Z coding region so that a fusion 
protein is produced; pIN vectors (Inouye & Inouye, 1985, Nucleic Acids Res. 13:3101-3109; 
Van Heeke & Schuster, 1989, J. Biol. Chem. 24:5503-5509); and the like. pGEX vectors 
may also be used to express foreign polypeptides as fusion proteins with glutathione S- 
transferase (GST). In general, such fusion proteins are soluble and can easily be purified 
from lysed cells by adsorption and binding to a matrix glutathione-agarose beads followed 
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by elution in the presence of free glutathione. The pGEX vectors are designed to include 
thrombin or factor Xa protease cleavage sites so that the cloned target gene product can be 
released from the GST moiety. 

In an insect system, Autographa califomica nuclear polyhedrosis virus (AcNPV) is 
5 used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. 
The antibody coding sequence may be cloned individually into non-essential regions (for 
example the polyhedrin gene) of the virus and placed under control of an AcNPV promoter 
(for example the polyhedrin promoter). In mammalian host cells, a number of viral-based 
expression systems (< e.g ., an adenovirus expression system) may be utilized. 

10 As discussed above, a host cell strain may be chosen which modulates the expression 

of the inserted sequences, or modifies and processes the gene product in the specific fashion 
desired. Such modifications (e.g, glycosylation) and processing (e.g, cleavage) of protein 
products may be important for the function of the protein. 

For long-term, high-yield production of recombinant antibodies, stable expression is 
1 5 preferred. For example, cells lines that stably express an antibody of interest can be 
produced by transfecting the cells with an expression vector comprising the nucleotide 
sequence of the antibody and the nucleotide sequence of a selectable (e.g., neomycin or 
hygromycin), and selecting for expression of the selectable marker. Such engineered cell 
lines may be particularly useful in screening and evaluation of compounds that interact 
20 directly or indirectly with the antibody molecule. 

The expression levels of the antibody molecule can be increased by vector 
amplification (for a review, see Bebbington and Hentschel, The use o f vectors based on gene 
amplification for the expression of cloned genes in mammalian cells in DNA c loning, Vol.3. 
(Academic Press, New York, 1987)). When a marker in the vector system expressing 
25 antibody is amplifiable, increase in the level of inhibitor present in culture of host cell will 
increase the number of copies of the marker gene. Since the amplified region is associated 
with the antibody gene, production of the antibody will also increase (Crouse et al., 1983, 
Mol. Cell. Biol. 3:257). 

The host cell may be co-transfected with two expression vectors of the invention, the 
30 first vector encoding a heavy chain derived polypeptide and the second vector encoding a 
light chain derived polypeptide. The two vectors may contain identical selectable markers 
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which enable equal expression of heavy and light chain polypeptides. Alternatively, a single 
vector may be used which encodes both heavy and light chain polypeptides. In such 
situations, the light chain should be placed before the heavy chain to avoid an excess of toxic 
free heavy chain (Proudfoot, 1986, Nature 322:52; Kohler, 1980, Proc. Natl. Acad, Sci. USA 
77.2197). The coding sequences for the heavy and light chains may comprise cDNA or 
genomic DNA. 

Once the antibody molecule of the invention has been recombinantly expressed, it 
may be purified by any method known in the art for purification of an antibody molecule, for 
example, by chromatography ( e.g ., ion exchange chromatography, affinity chromatography 
such as with protein A or specific antigen, and sizing column chromatography), 
centrifugation, differential solubility, or by any other standard technique for the purification 
of proteins. 

Alternatively, any fusion protein may be readily purified by utilizing an antibody 
specific for the fusion protein being expressed. For example, a system described by 
Janknecht et al. allows for the ready purification of non-denatured fusion proteins expressed 
in human cell lines (Janknecht et al., 1991, Proc. Natl. Acad. Sci. USA 88:8972-897). In 
this system, the gene of interest is subcloned into a vaccinia recombination plasmid such that 
the open reading frame of the gene is translationally fused to an amino-terminal tag 
consisting of six histidine residues. The tag serves as a matrix binding domain for the fusion 
protein. Extracts from cells infected with recombinant vaccinia virus are loaded onto Ni2+ 
nitriloacetic acid-agarose columns and histidine-tagged proteins are selectively eluted with 
imidazole-containing buffers. 
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5.11 Conjugated Antibodies 

In a preferred embodiment, anti-BPI antibodies or fragments thereof are conjugated 
to a diagnostic or therapeutic moiety. The antibodies can be used for diagnosis or to 
determine the efficacy of a given treatment regimen. Detection can be facilitated by 
5 coupling the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bio luminescent materials-, radioactive nuclides, positron emitting metals (for use in positron 
emission tomography), and nonradioactive paramagnetic metal ions. See generally U.S. 
Patent No. 4,741,900 for metal ions which can be conjugated to antibodies for use as 
10 diagnostics according to the present invention. Suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, beta-galactosidase, or acetylcholinesterase; suitable 
prosthetic groups include streptavidin, avidin and biotin; suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride and phycoerythnn; suitable luminescent 
15 materials include luminol; suitable bioluminescent materials include luciferase, luciferin, 
and aequorin; and suitable radioactive nuclides include 125^ 13 1 j s 1 1 1 In and ^^Tc. 

An anti-BPI antibodies or fragments thereof can be conjugated to a therapeutic agent 
or drug moiety to modify a given biological response. The therapeutic agent or drug moiety 
is not to be construed as limited to classical chemical therapeutic agents. For example, the 
20 drug moiety may be a protein or polypeptide possessing a desired biological activity. Such 
proteins may include, for example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or 
diphtheria toxin; a protein such as tumor necrosis factor, a-interferon, p-interferon, nerve 
growth factor, platelet derived growth factor, tissue plasminogen activator, a thrombotic 
agent or an anti-angiogenic agent, e.g., angiostatin or endostatin; or, a biological response 
25 modifier such as a lymphokine, interleukin-1 (IL-1), interleukin-2 (IL-2), interleukin-6 (IL- 
6), granulocyte macrophage colony stimulating factor (GM-CSF), granulocyte colony 
stimulating factor (G-CSF), nerve growth factor (NGF) or other growth factor. 

Techniques for conjugating such therapeutic moiety to antibodies are well known, 
see, e.g. y Amon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 
30 Therapy", in Monoclonal Antibodies And Cancer Therapy; Reisfeld et al. (eds.), pp. 243-56 
(Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in Controlled 
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Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, Inc. 1987); 
Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in 
Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 
475-506 (1985); "Analysis, Results, And Future Prospective Of The Therapeutic Use Of 
Radiolabeled Antibody In Cancer Therapy", in Monoclonal Antibodies For Cancer 
Detection And Therapy, Baldwin et al. (eds.), pp. 303-16 (Academic Press 1985), and 
Thorpe et al., "The Preparation And Cytotoxic Properties Of Antibody-Toxin Conjugates", 
Immunol. Rev., 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

An antibody with or without a therapeutic moiety conjugated to it can be used as a 
therapeutic that is administered alone or in combination with cytotoxic factor(s) and/or 
cytokine(s). 

5.12 Diagnosis of Breast Cancer 

In accordance with the present invention, test samples of tissue, serum, plasma or 
urine obtained from a subject suspected of having or known to have breast cancer can be 
used for diagnosis or monitoring, or in identifying patients most likely to respond to specific 
therapeutic treatments. In one embodiment, a decreased abundance of one or more BFs or 
BPIs (or any combination of them) in a test sample relative to a control sample (from a 
subject or subjects free from breast cancer) or a previously determined reference range 
indicates the presence of breast cancer; BFs and BPIs suitable for this purpose are identified 
in Tables I, III, VI and VII respectively, as described in detail above. In another 
embodiment of the invention, an increased abundance of one or more BFs or BPIs (or any 
combination of them) in a test sample compared to a control sample or a previously 
determined reference range indicates the presence of breast cancer; BFs and BPIs suitable 
for this purpose are identified in Tables II, IV and Vm, respectively, as described in detail 
above. In another embodiment, the relative abundance of one or more BFs or BPIs (or any 
combination of them) in a test sample compared to a control sample or a previously 
determined reference range indicates a subtype of breast cancer (e.g., familial or sporadic 
breast cancer). In yet another embodiment, the relative abundance of one or more BFs or 
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BPIs (or any combination of them) in a test sample relative to a control sample or a 
previously determined reference range indicates the degree or severity of breast cancer. In 
any of the aforesaid methods, detection of one or more BPIs described herein may optionally 
be combined with detection of one or more additional biomarkers for breast cancer. Any 
5 suitable method in the art can be employed to measure the level of BFs and BPIs, including 
but not limited to the Preferred Technology described herein, kinase assays, immunoassays 
to detect and/or visualize the BPI (e.g., Western blot, immunoprecipitation followed by 
sodium dodecyl sulfate polyacrylamide gel electrophoresis, immunocytochemistry, etc.). In 
cases where a BPI has a known function, an assay for that function may be used to measure 
10 BPI expression. In a further embodiment, a decreased abundance of mRNA including one or 
more BPIs identified in Table VI or VII (or any combination of them) in a test sample 
relative to a control sample or a previously determined reference range indicates the 
presence of breast cancer. In yet a further embodiment, an increased abundance of mRNA 
encoding one or more BPIs identified in Table Vm or IX (or any combination of them) in a 
1 5 test sample relative to a control sample or previously determined reference range indicates 

the presence of breast cancer. Any suitable hybridization assay can be used to detect BPI 
expression by detecting and/or visualizing mRNA encoding the BPI (e.g., Northern assays, 
dot blots, in situ hybridization, etc.). 

In another embodiment of the invention, labeled antibodies, derivatives and analogs 
20 thereof, which specifically bind to a BPI can be used for diagnostic purposes to detect, 

diagnose, or monitor breast cancer. Preferably, breast cancer is detected in an animal, more 
preferably in a mammal and most preferably in a human. 

5.13 Screening Assays 

25 The invention provides methods for identifying agents ( e.g candidate compounds or 

test compounds) that bind to a BPI or have a stimulatory or inhibitory effect on the 
expression or activity of a BPI. The invention also provides methods of identifying agents, 
candidate compounds or test compounds that bind to a BPI-related polypeptide or a BPI 
fusion protein or have a stimulatory or inhibitory effect on the expression or activity of a 
30 BPI-related polypeptide or a BPI fusion protein. Examples of agents, candidate compounds 
or test compounds include, but are not limited to, nucleic acids (e.g., DNA and RNA), 
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carbohydrates, lipids, proteins, peptides, peptidomimetics, small molecules and other drugs. 
Agents can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide libraries, 
while the other four approaches are applicable to peptide, non-peptide oligomer or small 
molecule libraries of compounds (Lam, 1997, Anticancer Drug Des. 12:145; U.S. Patent 
No. 5,738,996; and U.S. Patent No.5,807,683, each of which is incorporated herein in its 
entirety by reference). 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
for example in: DeWitt et al., 1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb et al., 1994, 
Proc. Natl. Acad. Sci. USA 91:1 1422; Zuckermann et al., 1994, J. Med. Chem. 37:2678; 

Cho et al., 1993, Science 261:1303; Carrell et al., 1994, Angew. Chem. Int. Ed. Engl. 
33:2059; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et al., 1994, 

J. Med. Chem. 37: 1233, each of which is incorporated herein in its entirety by reference. 

Libraries of compounds may be presented, e.g. t presented in solution (e.g., Houghten, 
1992, Bio/Techniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips 
(Fodor, 1993, Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (Patent 
Nos. 5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al., 1992, Proc. Natl. Acad. 

Sci. USA 89:1865-1869) or phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 

1990, Science 249:404-406; Cwirla et al., 1990, Proc. Natl. Acad. Sci. USA 87:6378-6382; 
and Felici, 1991, J. Mol. Biol. 222:301-310), each of which is incorporated herein in its 
entirety by reference. 

In one embodiment, agents that interact with (i.e., bind to) a BPI, a BPI fragment 
(e.g. a functionally active fragment), a BPI-related polypeptide, a fragment of a BPI-related 
polypeptide, or a BPI fusion protein are identified in a cell-based assay system. In 
accordance with this embodiment, cells expressing a BPI, a fragment of a BPI, a BPI-related 
polypeptide, a fragment of a BPI-related polypeptide, or a BPI fusion protein are contacted 
with a candidate compound or a control compound and the ability of the candidate 
compound to interact with the BPI is determined. If desired, this assay may be used to 
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screen a plurality ( e.g . a library) of candidate compounds. The cell, for example, can be of 
prokaryotic origin (e.g., E. coli ) or eukaryotic origin (e.g., yeast or mammalian). Further, 
the cells can express the BPI, fragment of the BPI, BPI-related polypeptide, a fragment of 
the BPI-related polypeptide, or a BPI fusion protein endogenously or be genetically 
5 engineered to express the BPI, fragment of the BPI, BPI-related polypeptide, a fragment of 
the BPI-related polypeptide, or a BPI fusion protein. In certain instances, the BPI, fragment 
of the BPI, BPI-related polypeptide, a fragment of the BPI-related polypeptide, or a BPI 
fusion protein or the candidate compound is labeled, for example with a radioactive label 
(such as 32 P, 35 S or 125 I) or a fluorescent label (such as fluorescein isothiocyanate, rhodamine, 

1 0 phycoerythrin, phycocyanin, allophycocyanin, o^hthaldehyde or fluorescamine) to enable 
detection of an interaction between a BPI and a candidate compound. The ability of the 
candidate compound to interact directly or indirectly with a BPI, a fragment of a BPI, a BPI- 
related polypeptide, a fragment of a BPI-related polypeptide, or a BPI fusion protein can be 
determined by methods known to those of skill in the art. For example, the interaction 
1 5 between a candidate compound and a BPI, a fragment of a BPI, a BPI-related polypeptide, a 
fragment of a BPI-related polypeptide, or a BPI fusion protein can be determined by flow 
cytometry, a scintillation assay, immunoprecipitation or western blot analysis. 

In another embodiment, agents that interact with (i.e. y bind to) a BPI, a BPI fragment 
(e.g., a functionally active fragment) a BPI-related polypeptide, a fragment of a BPI-related 
20 polypeptide, or a BPI fusion protein are identified in a cell-free assay system. In accordance 
with this embodiment, a native or recombinant BPI or fragment thereof, or a native or 
recombinant BPI-related polypeptide or fragment thereof, or a BPI-fusion protein or 
fragment thereof, is contacted with a candidate compound or a control compound and the 
ability of the candidate compound to interact with the BPI or BPI-related polypeptide, or 
25 BPI fusion protein is determined. If desired, this assay may be used to screen a plurality 
(e.g. a library) of candidate compounds. Preferably, the BPI, BPI fragment, BPI-related 
polypeptide, a fragment of a BPI-related polypeptide, or a BPI-fusion protein is first 
immobilized, by, for example, contacting the BPI, BPI fragment, BPI-related polypeptide, a 
fragment of a BPI-related polypeptide, or a BPI fusion protein with an immobilized antibody 
30 which specifically recognizes and binds it, or by contacting a purified preparation of the 

BPI, BPI fragment, BPI-related polypeptide, fragment of a BPI-related polypeptide, or a BPI 
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fusion protein with a surface designed to bind proteins. The BPI, BPI fragment, BPI-related 
polypeptide, a fragment of a BPI-related polypeptide, or a BPI fusion protein may be 
partially or completely purified (e.g. y partially or completely free of other polypeptides) or 
part of a cell lysate. Further, the BPI, BPI fragment, BPI-related polypeptide, a fragment of 
a BPI-related polypeptide may be a fusion protein comprising the BPI or a biologically 
active portion thereof, or BPI-related polypeptide and a domain such as glutathionine-S- 
transferase. Alternatively, the BPI, BPI fragment, BPI-related polypeptide, fragment of a 
BPI-related polypeptide or BPI fusion protein can be biotinylated using techniques well 
known to those of skill in the art (e.g., biotinylation kit, Pierce Chemicals; Rockford, IL). 

The ability of the candidate compound to interact with a BPI, BPI fragment, BPI-related 
polypeptide, a fragment of a BPI-related polypeptide, or a BPI fusion protein can be can be 
determined by methods known to those of skill in the art. 

In another embodiment, a cell-based assay system is used to identify agents that bind 
to or modulate the activity of a protein, such as an enzyme, or a biologically active portion 
thereof, which is responsible for the production or degradation of a BPI or is responsible for 
the post- translational modification of a BPI. In a primary screen, a plurality (e.g. y a library) 
of compounds are contacted with cells that naturally or recombinantly express: (i) a BPI, an 
isoform of a BPI, a BPI homolog a BPI-related polypeptide, a BPI fusion protein, or a 
biologically active fragment of any of the foregoing; and (ii) a protein that is responsible for 
processing of the BPI, BPI isoform, BPI homolog, BPI-related polypeptide, BPI fusion 
protein, or fragment in order to identify compounds that modulate the production, 
degradation, or post-translational modification of the BPI, BPI isoform, BPI homolog, BPI- 
related polypeptide, BPI fusion protein or fragment. If desired, compounds identified in the 
primary screen can then be assayed in a secondary screen against cells naturally or 
recombinantly expressing the specific BPI of interest. The ability of the candidate 
compound to modulate the production, degradation or post-translational modification of a 
BPI, isoform, homolog, BPI-related polypeptide, or BPI fusion protein can be determined by 
methods known to those of skill in the art, including without limitation, flow cytometry, a 
scintillation assay, immunoprecipitation and western blot analysis. 

In another embodiment, agents that competitively interact with (z'.e., bind to) a BPI, 

BPI fragment, BPI-related polypeptide, a fragment of a BPI-related polypeptide, or a BPI 
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fusion protein are identified in a competitive binding assay. In accordance with this 
embodiment, cells expressing a BPI, BPI fragment, BPI-related polypeptide, a fragment of a 
BPI-related polypeptide, or a BPI fusion protein are contacted with a candidate compound 
and a compound known to interact with the BPI, BPI fragment, BPI-related polypeptide, a 
5 fragment of a BPI-related polypeptide or a BPI fusion protein; the ability of the candidate 

compound to competitively interact with the BPI, BPI fragment, BPI-related polypeptide, 
fragment of a BPI-related polypeptide, or a BPI fusion protein is then determined. 
Alternatively, agents that competitively interact with (i.e., bind to) a BPI, BPI fragment, 
BPI-related polypeptide or fragment of a BPI-related polypeptide are identified in a cell-free 
10 assay system by contacting a BPI, BPI fragment, BPI-related polypeptide, fragment of a 
BPI-related polypeptide, or a BPI fusion protein with a candidate compound and a 
compound known to interact with the BPI, BPI-related polypeptide or BPI fusion protein. 

As stated above, the ability of the candidate compound to interact with a BPI, BPI fragment, 
BPI-related polypeptide, a fragment of a BPI-related polypeptide, or a BPI fusion protein 
15 can be determined by methods known to those of skill in the art. These assays, whether cell- 
based or cell-free, can be used to screen a plurality (e.g., a library) of candidate compounds. 

In another embodiment, agents that modulate (i.e., upregulate or downregulate) the 
expression of a BPI, or a BPI-related polypeptide are identified by contacting cells (e.g . , \ 

cells of prokaryotic origin or eukaryotic origin) expressing the BPI, or BPI-related 
20 polypeptide with a candidate compound or a control compound (e.g., phosphate buffered 
saline (PBS)) and determining the expression of the BPI, BPI-related polypeptide, or BPI 
fusion protein, mRNA encoding the BPI, or mRNA encoding the BPI-related polypeptide. 
The level of expression of a selected BPI, BPI-related polypeptide, mRNA encoding the 
BPI, or mRNA encoding the BPI-related polypeptide in the presence of the candidate 
25 compound is compared to the level of expression of the BPI, BPI-related polypeptide, 

mRNA encoding the BPI, or mRNA encoding the BPI-related polypeptide in the absence of 
the candidate compound (e.g . , in the presence of a control compound). The candidate 
compound can then be identified as a modulator of the expression of the BPI, or a BPI- | 

related polypeptide based on this comparison. For example, when expression of the BPI or 
30 mRNA is significantly greater in the presence of the candidate compound than in its 

absence, the candidate compound is identified as a stimulator of expression of the BPI or 
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mRNA. Alternatively, when expression of the BPI or mRNA is significantly less in the 
presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of the expression of the BPI or mRNA. The level of expression of 
a BPI or the mRNA that encodes it can be determined by methods known to those of skill in 
the art. For example, mRNA expression can be assessed by Northern blot analysis or 
RT-PCR, and protein levels can be assessed by western blot analysis. 

In another embodiment, agents that modulate the activity of a BPI, or a BPI-related 
polypeptide are identified by contacting a preparation containing the BPI or BPI-related 
polypeptide, or cells (eg., prokaryotic or eukaryotic cells) expressing the BPI or BPI-related 
polypeptide with a test compound or a control compound and determining the ability of the 
test compound to modulate (eg., stimulate or inhibit) the activity of the BPI or BPI-related 
polypeptide. The activity of a BPI or a BPI-related polypeptide can be assessed by detecting 
induction of a cellular signal transduction pathway of the BPI or BPI-related polypeptide 
(eg. , intracellular Ca2+, diacylglycerol, IP3, etc.), detecting catalytic or enzymatic activity 
of the target on a suitable substrate, detecting the induction of a reporter gene ( eg., a 
regulatory element that is responsive to a BPI or a BPI-related polypeptide and is operably 
linked to a nucleic acid encoding a detectable marker, eg., luciferase), or detecting a cellular 
response, for example, cellular differentiation, or cell proliferation. Based on the present 
description, techniques known to those of skill in the art can be used for measuring these 
activities (see, e.g., U.S. Patent No. 5,401,639, which is incorporated herein by reference). 

The candidate compound can then be identified as a modulator of the activity of a BPI or 
BPI-related polypeptide by comparing the effects of the candidate compound to the control 
compound. Suitable control compounds include phosphate buffered saline (PBS) and 
normal saline (NS). 

In another embodiment, agents that modulate (/>., upregulate or downregulate) the 
expression, activity or both the expression and activity of a BPI or BPI-related polypeptide 
are identified in an animal model. Examples of suitable animals include, but are not limited 
to, mice, rats, rabbits, monkeys, guinea pigs, dogs and cats. Preferably, the animal used 
represent a model of breast cancer (e.g., xenografts of human breast cancer cell lines such as 
MDA-MB-345 inestrogen-depreived Severe Combined Immunodeficient (SCID) mice, 

Eccles et al. 1994 Cell Biophysics 24/25, 279). In accordance with this embodiment, the test 
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compound or a control compound is administered ( e.g ., orally, rectally or parenterally such 
as intraperitoneally or intravenously) to a suitable animal and the effect on the expression, 
activity or both expression and activity of the BPI or BPI-related polypeptide is determined. 
Changes in the expression of a BPI or BPI-related polypeptide can be assessed by the 
5 methods outlined above. 

In yet another embodiment, a BPI or BPI-related polypeptide is used as a "bait 
protein" in a two-hybrid assay or three hybrid assay to identify other proteins that bind to or 
interact with a BPI or BPI-related polypeptide (see, e.g., U.S. Patent No. 5,283,317; Zervos 
et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel 
10 et al. (1993) Bio/Techniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; 
and PCT Publication No. WO 94/10300). As those skilled in the art will appreciate, such 
binding proteins are also likely to be involved in the propagation of signals by the BPIs of 
the inventions as, for example, upstream or downstream elements of a signaling pathway 
involving the BPIs of the invention. 

15 In a preferred embodiment, the screens and assays described herein, are used for 

example to screen for or identify a compound that modulates the activity of (or that 
modulates both the expression and activity of) a BPI, BPI analog, or BPI-related 
polypeptide, a fragment of any of the foregoing or a BPI fusion protein. 

This invention further provides novel agents identified by the above-described 
20 screening assays and uses thereof for treatments as described herein. 

5.14 Therapeutic Uses of BPIs 

The invention provides for treatment or prevention of various diseases and disorders 
by administration of a therapeutic compound. Such compounds include but are not limited 
25 to: BPIs, BPI analogs, BPI-related polypeptides and derivatives (including fragments) 

thereof; antibodies to the foregoing; nucleic acids encoding BPIs, BPI analogs, BPI-related 
polypeptides and fragments thereof; antisense nucleic acids to a gene encoding a BPI or 
BPI-related polypeptide; and modulator (e.g., agonists and antagonists) of a gene encoding a 
BPI or BPI-related polypeptide. An important feature of the present invention is the 
30 identification of genes encoding BPIs involved in breast cancer. Breast cancer can be 
treated (e.g. to ameliorate symptoms or to retard onset or progression) or prevented by 
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administration of a therapeutic compound that promotes function or expression of one or 
more BPIs that are decreased in the serum of breast cancer subjects, or by administration of 
a therapeutic compound that reduces function or expression of one or more BPIs that are 
increased in the serum of subjects having breast cancer. 

In one embodiment, one or more antibodies each specifically binding to a BPI are 
administered alone or in combination with one or more additional therapeutic compounds or 
treatments. Examples of such therapeutic compounds or treatments include, but are not 
limited to, taxol, cyclophosphamide, tamoxifen, fluorouracil and doxorubicin. 

Preferably, a biological product such as an antibody is allogeneic to the subject to 
which it is administered. In a preferred embodiment, a huma BPI or a huma BPI-related 
polypeptide, a nucleotide sequence encoding a huma BPI or a huma BPI-related polypeptide, 
or an antibody to a huma BPI or a huma BPI-related polypeptide, is administered to a human 
subject for therapy (e.g. to ameliorate symptoms or to retard onset or progression) or 
prophylaxis. 

5J4.1 Treatment And Prevention of Breast Cancer 

Breast cancer is treated or prevented by administration to a subject suspected of 
having or known to have breast cancer or to be at risk of developing breast cancer of a 
compound that modulates (i.e., increases or decreases) the level or activity (i.e., function) of 
one or more BPIs - or the level of one or more BFs - that are differentially present in the 
serum of subjects having breast cancer compared with serum of subjects free from breast 
cancer. In one embodiment, breast cancer is treated or prevented by administering to a 
subject suspected of having or known to have breast cancer or to be at risk of developing 
breast cancer a compound that upregulates (i.e, increases) the level or activity (i.e, 
function) of one or more BPIs - or the level of one or more BFs - that are decreased in the 
serum of subjects having breast cancer. In another embodiment, a compound is 
administered that upregulates the level or activity (i.e., function) of one or more BPIs - or the 
level of one or more BFs - that are increased in the serum of subjects having breast cancer. 
Examples of such a compound include but are not limited to: BPIs, BPI fragments and BPI- 
related polypeptides; nucleic acids encoding a BPI, a BPI fragment and a BPI-related 
polypeptide (e.g., for use in gene therapy); and, for those BPIs or BPI-related polypeptides 
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with enzymatic activity, compounds or molecules known to modulate that enzymatic 
activity. Other compounds that can be used, eg, BPI agonists, can be identified using in 
vitro assays. 

Breast cancer is also treated or prevented by administration to a subject suspected of 
5 having or known to have breast cancer or to be at risk of developing breast cancer of a 

compound that downregulates the level or activity of one or more BPIs - or the level of one 
or more BFs - that are increased in the serum of subjects having breast cancer. In another 
embodiment, a compound is administered that downregulates the level or activity of one or 
more BPIs - or the level of one or more BFs - that are decreased in the serum of subjects 
10 having breast cancer. Examples of such a compound include, but are not limited to, BPI 

antisense oligonucleotides, ribozymes, antibodies directed against BPIs, and compounds that 
inhibit the enzymatic activity of a BPI. Other useful compounds e.g,, BPI antagonists and 
small molecule BPI antagonists, can be identified using in vitro assays. 

In a preferred embodiment, therapy or prophylaxis is tailored to the needs of an 
15 individual subject. Thus, in specific embodiments, compounds that promote the level or 
function of one or more BPIs, or the level of one or more BFs, are therapeutically or 
prophylactically administered to a subject suspected of having or known to have breast 
cancer, in whom the levels or functions of said one or more BPIs, or levels of said one or 
more BFs, are absent or are decreased relative to a control or normal reference range. In 
20 further embodiments, compounds that promote the level or function of one or more BPIs, or 
the level of one or more BFs, are therapeutically or prophylactically administered to a 
subject suspected of having or known to have breast cancer in whom the levels or functions 
of said one or more BPIs, or levels of said one or more BFs, are increased relative to a 
control or to a reference range. In further embodiments, compounds that decrease the level 
25 or function of one or more BPIs, or the level of one or more BFs, are therapeutically or 

prophylactically administered to a subject suspected of having or known to have breast 
cancer in whom the levels or functions of said one or more BPIs, or levels of said one or 
more BFs, are increased relative to a control or to a reference range. In further 
embodiments, compounds that decrease the level or function of one or more BPIs, or the 
30 level of one or more BFs, are therapeutically or prophylactically administered to a subject 
suspected of having or known to have breast cancer in whom the levels or functions of said 


81 



WO 01/13117 


PCT/GB00/03143 


one or more BPIs, or levels of said one or more BFs, are decreased relative to a control or to 
a reference range. The change in BPI function or level, or BF level, due to the 
administration of such compounds can be readily detected, e.g., by obtaining a sample ( e.g ., 
a sample of serum, blood or urine or a tissue sample such as biopsy tissue) and assaying in 
vitro the levels of said BFs or the levels or activities of said BPIs, or the levels of mRNAs 
encoding said BPIs. or any combination of the foregoing. Such assays can be performed 
before and after the administration of the compound as described herein. 

The compounds of the invention include but are not limited to any compound, e.g. f a 
small organic molecule, protein, peptide, antibody, nucleic acid, etc. that restores the breast 
cancer BPI or BF profile towards normal with the proviso that such compound is not taxol, 
cyclophosphamide, tamoxifen, fluorouracil and doxorubicin. 

5.14.2 Gene Therapy 

In a specific embodiment, nucleic acids comprising a sequence encoding a BPI, a 
BPI fragment, BPI-related polypeptide or fragment of a BPI-related polypeptide, are 
administered to promote BPI function by way of gene therapy. Gene therapy refers to 
administration to a subject of an expressed or expressible nucleic acid. In this embodiment, 
the nucleic acid produces its encoded polypeptide that mediates a therapeutic effect by 
promoting BPI function. 

Any of the methods for gene therapy available in the art can be used according to the 
present invention. Exemplary methods are described below. 

For general reviews of the methods of gene therapy, see Goldspiel et al., 1993, 

Clinical Pharmacy 12:488-505; Wu and Wu, 1991, Biotherapy 3:87-95; Tolstoshev, 1993, 
Ann. Rev. Pharmacol. Toxicol. 32:573-596; Mulligan, 1993, Science 260:926-932; and 
Morgan and Anderson, 1993, Ann. Rev. Biochem. 62:191-217; May, 1993, TIBTECH 
1 1(5). 155-215). Methods commonly known in the art of recombinant DNA technology 
which can be used are described in Ausubel et al. (eds.), 1993, Current Protocols in 
Molecular Biology, John Wiley & Sons, NY; and Kriegler, 1990, Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY. 

In a preferred aspect, the compound comprises a nucleic acid encoding a BPI or 
fragment or chimeric protein thereof, said nucleic acid being part of an expression vector 
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that expresses a BPI or fragment or chimeric protein thereof in a suitable host. In particular, 
such a nucleic acid has a promoter operably linked to the BPI coding region, said promoter 
being inducible or constitutive (and, optionally, tissue-specific). In another particular 
embodiment, a nucleic acid molecule is used in which the BPI coding sequences and any 
5 other desired sequences are flanked by regions that promote homologous recombination at a 
desired site in the genome, thus providing for intrachromosomal expression of the BPI 
nucleic acid (Koller and Smithies, 1989, Proc. Natl. Acad. Sci. USA 86:8932-8935, Zijlstra 
et al., 1989, Nature 342:435-438). 

Delivery of the nucleic acid into a subject may be direct, in which case the subject is 
1 0 directly exposed to the nucleic acid or nucleic acid-carrying vector; this approach is known 
as in vivo gene therapy. Alternatively, delivery of the nucleic acid into the subject may be 
indirect, in which case cells are first transformed with the nucleic acid in vitro and then 
transplanted into the subject; this approach is known as ex vivo gene therapy. 

In a specific embodiment, the nucleic acid is directly administered in vivo , where it is 
1 5 expressed to produce the encoded product. This can be accomplished by any of numerous 
methods known in the art, e.g., by constructing it as part of an appropriate nucleic acid 
expression vector and administering it so that it becomes intracellular, e.g, by infection 
using a defective or attenuated retroviral or other viral vector (see U.S. Patent No. 
4,980,286); by direct injection of naked DNA; by use of microparticle bombardment (e.g, a 
20 gene gun; Biolistic, Dupont); by coating with lipids, cell-surface receptors or transfecting 
agents; by encapsulation in liposomes, microparticles or microcapsules; by administering it 
in linkage to a peptide which is known to enter the nucleus; or by administering it in linkage 
to a ligand subject to receptor-mediated endocytosis (see, e.g, Wu and Wu, 1987, J. Biol. 
Chem. 262:4429-4432), which can be used to target cell types specifically expressing the 
25 receptors. In another embodiment, a nucleic acid-ligand complex can be formed in which 
the ligand comprises a fusogenic viral peptide to disrupt endosomes, allowing the nucleic 
acid to avoid lysosomal degradation. In yet another embodiment, the nucleic acid can be 
targeted in vivo for cell specific uptake and expression, by targeting a specific receptor (see, 
e.g., PCT Publications WO 92/06180 dated April 16, 1992 (Wu et al.); WO 92/22635 dated 
30 December 23, 1992 (Wilson et al.); WO92/203 16 dated November 26, 1992 (Findeis et al.); 
W093/14188 dated July 22, 1993 (Clarke et al.), WO 93/20221 dated October 14, 1993 
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(Young)). Alternatively, the nucleic acid can be introduced intracellularly and incorporated 
within host cell DNA for expression, by homologous recombination (Koller and Smithies, 
1989, Proc. Natl. Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 342:435-438). 

In a specific embodiment, a viral vector that contains a nucleic acid encoding a BPI 
is used. For example, a retroviral vector can be used (see Miller et al., 1 993, Meth, 
Enzymol. 217:581-599). These retroviral vectors have been modified to delete retroviral 
sequences that are not necessary for packaging of the viral genome and integration into host 
cell DNA. The nucleic acid encoding the BPI to be used in gene therapy is cloned into the 
vector, which facilitates delivery of the gene into a subject. More detail about retroviral 
vectors can be found in Boesen et al., 1994, Biotherapy 6:291-302, which describes the use 
of a retroviral vector to deliver the mdrl gene to hematopoietic stem cells in order to make 
the stem cells more resistant to chemotherapy. Other references illustrating the use of 
retroviral vectors in gene therapy are: Clowes et al., 1994, J. Clin. Invest. 93:644-651; Kiem 
et al., 1994, Blood 83:1467-1473; Salmons and Gunzberg, 1993, Human Gene Therapy 
4:129-141; and Grossman and Wilson, 1993, Curr. Opin. in Genetics and Devel. 3:110-1 14. 

Adenoviruses are other viral vectors that can be used in gene therapy. Adenoviruses 
are especially attractive vehicles for delivering genes to respiratory epithelia. Adenoviruses 
naturally infect respiratory epithelia where they cause a mild disease. Other targets for 
adenovirus-based delivery systems are liver, the central nervous system, endothelial cells, 
and muscle. Adenoviruses have the advantage of being capable of infecting non-dividing 
cells. Kozarsky and Wilson, 1993, Current Opinion in Genetics and Development 3:499- 
503 present a review of adenovirus-based gene therapy. Bout et al., 1994, Human Gene 
Therapy 5:3-10 demonstrated the use of adenovirus vectors to transfer genes to the 
respiratory epithelia of rhesus monkeys. Other instances of the use of adenoviruses in gene 
therapy can be found in Rosenfeld et al., 1991, Science 252:431-434; Rosenfeld et al., 1992, 
Cell 68:143-155; Mastrangeli et al., 1993, J. Clin. Invest. 91 :225-234; PCT Publication 
W094/12649; and Wang, et al., 1995, Gene Therapy 2:775-783. 

Adeno-associated virus (AAV) has also been proposed for use in gene therapy 
(Walsh et al., 1993, Proc. Soc. Exp. Biol. Med. 204:289-300; U.S. Patent No. 5,436,146). 

Another approach to gene therapy involves transferring a gene to cells in tissue 
culture by such methods as electroporation, lipofection, calcium phosphate mediated 
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transfection, or viral infection. Usually, the method of transfer includes the transfer of a 
selectable marker to the cells. The cells are then placed under selection to isolate those cells 
that have taken up and are expressing the transferred gene. Those cells are then delivered to 
a subject. 

5 In this embodiment, the nucleic acid is introduced into a cell prior to administration 

in vivo of the resulting recombinant cell. Such introduction can be carried out by any 
method known in the art, including but not limited to transfection, electroporation, 
microinjection, infection with a viral or bacteriophage vector containing the nucleic acid 
sequences, cell fusion, chromosome-mediated gene transfer, microcell-mediated gene 
10 transfer, spheroplast fusion, etc. Numerous techniques are known in the art for the 

introduction of foreign genes into cells (see, e.g. y Loeffler and Behr, 1993, Meth. Enzymol. 
217:599-618; Cohen et al, 1993, Meth. Enzymol. 217:618-644; Cline, 1985, Pharmac. Ther. 
29:69-92) and may be used in accordance with the present invention, provided that the 
necessary developmental and physiological functions of the recipient cells are not disrupted. 

1 5 The technique should provide for the stable transfer of the nucleic acid to the cell, so that 

the nucleic acid is expressible by the cell and preferably heritable and expressible by its cell 
progeny. 

The resulting recombinant cells can be delivered to a subject by various methods 
known in the art. In a preferred embodiment, epithelial cells are injected, e.g. y 
20 subcutaneously. In another embodiment, recombinant skin cells may be applied as a skin 
graft onto the subject. Recombinant blood cells (e.g . , hematopoietic stem or progenitor 
cells) are preferably administered intravenously. The amount of cells envisioned for use 
depends on the desired effect, the condition of the subject, etc., and can be determined by 
one skilled in the art. 

25 Cells into which a nucleic acid can be introduced for purposes of gene therapy 

encompass any desired, available cell type, and include but are not limited to neuronal cells, 
glial cells (e.g . , oligodendrocytes or astrocytes), epithelial cells, endothelial cells, 
keratinocytes, fibroblasts, muscle cells, hepatocytes; blood cells such as T lymphocytes, 

B lymphocytes, monocytes, macrophages, neutrophils, eosinophils, megakaryocytes, 

30 granulocytes; various stem or progenitor cells, in particular hematopoietic stem or progenitor 
cells, e.g. y as obtained from bone marrow, umbilical cord blood, peripheral blood or fetal 


85 



WO 01/13117 


PCT/GB00/03143 


liver. 

In a preferred embodiment, the cell used for gene therapy is autologous to the subject 
that is treated. 

In an embodiment in which recombinant cells are used in gene therapy, a nucleic acid 
encoding a BPI is introduced into the cells such that it is expressible by the cells or their 
progeny, and the recombinant cells are then administered in vivo for therapeutic effect. In a 
specific embodiment, stem or progenitor cells are used. Any stem or progenitor cells which 
can be isolated and maintained in vitro can be used in accordance with this embodiment of 
the present invention (see e.g. PCT Publication WO 94/08598, dated April 28, 1994; 

Stemple and Anderson, 1992, Cell 71:973-985; Rheinwald, 1980, Meth. Cell Bio. 21A:229; 
and Pittelkow and Scott, 1986, Mayo Clinic Proc. 61:771). 

In a specific embodiment, the nucleic acid to be introduced for purposes of gene 
therapy comprises an inducible promoter operably linked to the coding region, such that 
expression of the nucleic acid is controllable by controlling the presence or absence of the 
appropriate inducer of transcription. 

Direct injection of a DNA coding for a BPI may also be performed according to/ for 
example, the techniques described in United States Patent No. 5,589,466. These techniques 
involve the injection of "naked DNA", /.<?., isolated DNA molecules in the absence of 
liposomes, cells, or any other material besides a suitable carrier. The injection of DNA 
encoding a protein and operably linked to a suitable promoter results in the production of the 
protein in cells near the site of injection and the elicitation of an immune response in the 
subject to the protein encoded by the injected DNA. In a prefenred embodiment, naked 
DNA comprising (a) DNA encoding a BPI and (b) a promoter are injected into a subject to 
elicit an immune response to the BPI. 

5.14.3 Inhibition of BPIs to Treat Breast Cancer 

In one embodiment of the invention, breast cancer is treated or prevented by 
administration of a compound that antagonizes (inhibits) the level(s) and/or function(s) of 
one or more BPIs which are elevated in the serum of subjects having breast cancer as 
compared with serum of subjects free from breast cancer. Compounds useful for this 
purpose include but are not limited to anti-BPI antibodies (and fragments and derivatives 
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containing the binding region thereof), BPI antisense or ribozyme nucleic acids, and nucleic 
acids encoding dysfunctional BPIs that are used to "knockout" endogenous BPI function by 
homologous recombination (see, e.g., Capecchi, 1989, Science 244:1288-1292). Other 
compounds that inhibit BPI function can be identified by use of known in vitro assays, e.g., 

5 assays for the ability of a test compound to inhibit binding of a BPI to another protein or a 
binding partner, or to inhibit a known BPI function. Preferably such inhibition is assayed in 
vitro or in cell culture, but genetic assays may also be employed. The Preferred Technology 
can also be used to detect levels of the BPI before and after the administration of the 
compound. Preferably, suitable in vitro or in vivo assays are utilized to determine the effect 
10 of a specific compound and whether its administration is indicated for treatment of the 
affected tissue, as described in more detail below. 

In a specific embodiment, a compound that inhibits a BPI function is administered 
therapeutically or prophylactically to a subject in whom an increased serum level or 
functional activity of the BPI (e.g . , greater than the normal level or desired level) is detected 
1 5 as compared with serum of subjects free from breast cancer or a predetermined reference 
range. Methods standard in the art can be employed to measure the increase in a BPI level 
or function, as outlined above. Preferred BPI inhibitor compositions include small 
molecules, i.e., molecules of 1000 daltons or less. Such small molecules can be identified 
by the screening methods described herein. 

20 

5.14.4 Antisense Regulation of BPIs 

In a specific embodiment, BPI expression is inhibited by use of BPI antisense nucleic 
acids. The present invention provides the therapeutic or prophylactic use of nucleic acids 
comprising at least six nucleotides that are antisense to a gene or cDNA encoding a BPI or a 
25 portion thereof. As used herein, a BPI "antisense" nucleic acid refers to a nucleic acid 

capable of hybridizing by virtue of some sequence complementarity to a portion of an RNA 
(preferably mRNA) encoding a BPI. The antisense nucleic acid may be complementary to a 
coding and/or noncoding region of an mRNA encoding a BPI. Such antisense nucleic acids 
have utility as compounds that inhibit BPI expression, and can be used in the treatment or 
30 prevention of breast cancer. 

The antisense nucleic acids of the invention are double-stranded or single-stranded 
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oligonucleotides, RNA or DNA or a modification or derivative thereof, and can be directly 
administered to a cell or produced intracellularly by transcription of exogenous, introduced 
sequences. 

The invention further provides pharmaceutical compositions comprising an effective 
amount of the BPI antisense nucleic acids of the invention in a pharmaceutically acceptable 
carrier, as described infra . 

In another embodiment, the invention provides methods for inhibiting the expression 
of a BPI nucleic acid sequence in a prokaryotic or eukaryotic cell comprising providing the 
cell with an effective amount of a composition comprising a BPI antisense nucleic acid of 
the invention. 

BPI antisense nucleic acids and their uses are described in detail below. 

5.14.4.1 BPI Antisense Nucleic Acids 

The BPI antisense nucleic acids are of at least six nucleotides and are preferably 
oligonucleotides ranging from 6 to about 50 oligonucleotides. In specific aspects, the 
oligonucleotide is at least 10 nucleotides, at least 15 nucleotides, at least 100 nucleotides, or 
at least 200 nucleotides. The oligonucleotides can be DNA or RNA or chimeric mixtures or 
derivatives or modified versions thereof and can be single-stranded or double-stranded. The 
oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate backbone. 
The oligonucleotide may include other appended groups such as peptides; agents that 
facilitate transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. 

Acad. Sci. USA 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; 

PCT Publication No. WO 88/09810, published December 15, 1988) or blood-brain barrier 
(see, e.g. y PCT Publication No. WO 89/10134, published April 25, 1988); hybridization- 
triggered cleavage agents (see, eg., Krol et al., 1988, BioTechniques 6:958-976) or 
intercalating agents (see, eg, Zon, 1S>88, Pharm. Res. 5:539-549). 

In a preferred aspect of the invention, a BPI antisense oligonucleotide is provided, 
preferably of single-stranded DNA. The oligonucleotide may be modified at any position on 
its structure with substituents generally known in the art. 

The BPI antisense oligonucleotide may comprise at least one of the following 
modified base moieties: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
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hypoxanthine, xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluraciI, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 

1- methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 

5 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 

5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uraci 1-5 -oxy acetic acid (v), wybutoxosine, pseudouracil, queosine, 

2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 

10 5-oxyacetic acid mettiylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, 2,6-diaminopurine, and other base analogs. 

In another embodiment, the oligonucleotide comprises at least one modified sugar 
moiety, e.g, one of the following sugar moieties: arabinose, 2-fluoroarabinose, xylulose, and 
hexose. 

15 In yet another embodiment, the oligonucleotide comprises at least one of the 

following modified phosphate backbones: a phosphorothioate, a phosphorodithioate, a 
phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a methylphosphonate, an 
alkyl phosphotriester, a formacetal, or an analog of formacetal. 

In yet another embodiment, the oligonucleotide is an a-anomeric oligonucleotide. An 
20 a-anomeric oligonucleotide forms specific double-stranded hybrids with complementary 

RNA in which, contrary to the usual p-units, the strands run parallel to each other (Gautier et 
al„ 1987, Nucl. Acids Res. 15:6625-6641). 

The oligonucleotide may be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, or hybridization-triggered 
25 cleavage agent. 

Oligonucleotides of the invention may be synthesized by standard methods known in 
the art, e.g., by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al. (1988, Nucl. Acids Res. 16:3209), and 
30 methylphosphonate oligonucleotides can be prepared by use of controlled pore glass 
polymer supports (Sarin et al., 1988, Proc. Natl Acad. Sci. USA 85:7448-7451). 
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In a specific embodiment, the BPI antisense nucleic acid of the invention is produced 
intracellularly by transcription from an exogenous sequence. For example, a vector can be 
introduced in vivo such that it is taken up by a cell, within which cell the vector or a portion 
thereof is transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a 
vector would contain a sequence encoding the BPI antisense nucleic acid. Such a vector can 
remain episomal or become chromosomally integrated, as long as it can be transcribed to 
produce the desired antisense RNA. Such vectors can be constructed by recombinant DNA 
technology standard in the art. Vectors can be plasmid, viral, or others known in the art, 
used for replication and expression in mammalian cells. Expression of the sequence 
encoding the BPI antisense RNA can be by any promoter known in the art to act in 
mammalian, preferably human, cells. Such promoters can be inducible or constitutive. 
Examples of such promoters are outlined above. 

The antisense nucleic acids of the invention comprise a sequence complementary to 
at least a portion of an RNA transcript of a gene encoding a BPI, preferably a human gene 
encoding a BPI. However, absolute complementarity, although preferred, is not required. A 
sequence "complementary to at least a portion of an RNA," as referred to herein, means a 
sequence having sufficient complementarity to be able to hybridize under stringent 
conditions (e.g., highly stringent conditions comprising hybridization in 7% sodium dodecyl 
sulfate (SDS), 1 mM EDTA at 65°C and washing in 0.1xSSC/0.1% SDS at 68°C, or 
moderately stringent conditions comprising washing in 0.2xSSC/0.1% SDS at 42°C ) with 
the RNA, forming a stable duplex; in the case of double-stranded API antisense nucleic 
acids, a single strand of the duplex DNA may thus be tested, or triplex formation may be 
assayed. The ability to hybridize will depend on both the degree of complementarity and the 
length of the antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the 
more base mismatches with an RNA encoding a BPI it may contain and still form a stable 
duplex (or triplex, as the case may be). One skilled in the art can ascertain a tolerable degree 
of mismatch by use of standard procedures to determine the melting point of the hybridized 
complex. 

5.14.4.2 Therapeutic Use of BPI Antisense Nucleic Acids 

The BPI antisense nucleic acids can be used to treat or prevent breast cancer when 
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the target BPI is overexpressed in the serum of subjects suspected of having or suffering 
from breast cancer. In a preferred embodiment, a single-stranded DNA antisense BPI 
oligonucleotide is used. 

Cell types which express or overexpress RNA encoding a BPI can be identified by 
5 various methods known in the art. Such cell types include but are not limited to leukocytes 
(e.g., neutrophils, macrophages, monocytes) and resident cells (e.g., astrocytes, glial cells, 
neuronal cells, and ependymal cells). Such methods include, but are not limited to, 
hybridization with a BPI-specific nucleic acid (e.g., by Northern hybridization, dot blot 
hybridization, in situ hybridization), observing the ability of RNA from the cell type to be 
10 translated in vitro into a BPI, immunoassay, etc. In a preferred aspect, primary tissue from a 
subject can be assayed for BPI expression prior to treatment, e.g., by immunocytochemistry 
or in situ hybridization. 

Pharmaceutical compositions of the invention, comprising an effective amount of a 
BPI antisense nucleic acid in a pharmaceutically acceptable carrier, can be administered to a 
1 5 subject having breast cancer. 

The amount of BPI antisense nucleic acid which will be effective in the treatment of 
breast cancer can be determined by standard clinical techniques. 

In a specific embodiment, pharmaceutical compositions comprising one or more BPI 
antisense nucleic acids are administered via liposomes, microparticles, or microcapsules. In 
20 various embodiments of the invention, such compositions may be used to achieve sustained 
release of the BPI antisense nucleic acids. 

5.14.5 Inhibitory Ribozvme And Triple Helix Approaches 

In another embodiment, symptoms of breast cancer may be ameliorated by 
25 decreasing the level of a BPI or BPI activity by using gene sequences encoding the BPI in 
conjunction with well-known gene "knock-out,’ " ribozyme or triple helix methods to 
decrease gene expression of a BPI. In this approach ribozyme or triple helix molecules are 
used to modulate the activity, expression or synthesis of the gene encoding the BPI, and thus 
to ameliorate the symptoms of breast cancer. Such molecules may be designed to reduce or 
30 inhibit expression of a mutant or non-mutant target gene. Techniques for the production and 
use of such molecules are well known to those of skill in the art. 
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Ribozyme molecules designed to catalytically cleave gene mRNA transcripts 
encoding a BPI can be used to prevent translation of target gene mRNA and, therefore, 
expression of the gene product. (See, e.g . , PCT International Publication WO90/1 1364, 
published October 4, 1990; Sarver et al., 1990, Science 247:1222-1225). 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage 
of RNA. (For a review, see Rossi, 1994, Current Biology 4, 469-471). The mechanism of 
ribozyme action involves sequence specific hybridization of the ribozyme molecule to 
complementary target RNA, followed by an endonucleolytic cleavage event. The 
composition of ribozyme molecules must include one or more sequences complementary to 
the target gene mRNA, and must include the well known catalytic sequence responsible for 
mRNA cleavage. For this sequence, see, e.g, U.S. Patent No. 5,093,246, which is 
incorporated herein by reference in its entirety. 

While ribozymes that cleave mRNA at site specific recognition sequences can be 
used to destroy mRNAs encoding an API, the use of hammerhead ribozymes is preferred. 
Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5’-UG-3\ The construction and 
production of hammerhead ribozymes is well known in the art and is described more Hilly in 
Myers, 1995, Molecular Biology and Biotechnology: A Comprehensive Desk Reference. 
VCH Publishers, New York, (see especially Figure 4, page 833) and in Haseloff and 
Gerlach, 1988, Nature, 334, 585-591, each of which is incorporated herein by reference in its 
entirety. 

Preferably the ribozyme is engineered so that the cleavage recognition site is located 
near the 5' end of the mRNA encoding the API, z.e, to increase efficiency and minimize the 
intracellular accumulation of non-functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one that occurs naturally in Tetrahymena 
thermophila (known as the IVS, or L-19 IVS RNA) and that has been extensively described 
by Thomas Cech and collaborators (Zaug, et al., 1984, Science, 224, 574-578; Zaug and 
Cech, 1986, Science, 231, 470-475; Zaug, et al, 1986, Nature, 324, 429-433; published 
International patent application No. WO 88/04300 by University Patents Inc.; Been and 
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Cech, 1986, Cell, 47, 207-216). The Cech-type ribozymes have an eight base pair active site 
which hybridizes to a target RNA sequence whereafter cleavage of the target RNA takes 
place. The invention encompasses those Cech-type ribozymes which target eight base-pair 
active site sequences that are present in the gene encoding the BP1. 

5 As in the antisense approach, the ribozymes can be composed of modified 

oligonucleotides (e.g., for improved stability, targeting, etc.) and should be delivered to cells 
that express the BPI in vivo. A preferred method of delivery involves using a DNA 
construct "encoding" the ribozyme under the control of a strong constitutive pol in or pol II 
promoter, so that transfected cells will produce sufficient quantities of the ribozyme to 
10 destroy endogenous mRNA encoding the BPI and inhibit translation. Because ribozymes, 
unlike antisense molecules, are catalytic, a lower intracellular concentration is required for 
efficacy. 

Endogenous BPI expression can also be reduced by inactivating or "knocking out" 
the gene encoding the BPI, or the promoter of such a gene, using targeted homologous 
15 recombination ( e.g ., see Smithies, et al., 1985, Nature 317:230-234; Thomas and Capecchi, 
1987, Cell 51:503-512; Thompson et al., 1989, Cell 5:313-321; and Zijlstra et al., 1989, 
Nature 342:435-438, each of which is incorporated by reference herein in its entirety). For 
example, a mutant gene encoding a non- functional BPI (or a completely unrelated DNA 
sequence) flanked by DNA homologous to the endogenous gene (either the coding regions 
20 or regulatory regions of the gene encoding the BPI) can be used, with or without a selectable 

marker and/or a negative selectable marker, to transfect cells that express the target gene in 
vivo. Insertion of the DNA construct, via targeted homologous recombination, results in 
inactivation of the target gene. Such approaches are particularly suited in the agricultural 
field where modifications to ES (embryonic stem) cells can be used to generate animal 
25 offspring with an inactive target gene (e.g., see Thomas and Capecchi, 1987 and Thompson, 
1989, supra). However this approach can be adapted for use in humans provided the 
recombinant DNA constructs are directly administered or targeted to the required site in vivo 
using appropriate viral vectors. 

Alternatively, the endogenous expression of a gene encoding a BPI can be reduced 
30 by targeting deoxyribonucleotide sequences complementary to the regulatory region of the 
gene (i.e., the gene promoter and/or enhancers) to form triple helical structures that prevent 
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transcription of the gene encoding the BPI in target cells in the body. (See generally, Helene, 
1991, Anticancer Drug Des., 6(6), 569-584; Helene, et al., 1992, Ann. N.Y. Acad. Sci., 660, 
27-36; and Maher, 1992, Bioassays 14(12), 807-815). 

Nucleic acid molecules to be used in triplex helix formation for the inhibition of 
transcription should be single stTanded and composed of deoxynucleotides. The base 
composition of these oligonucleotides must be designed to promote triple helix formation 
via Hoogsteen base pairing rules, which generally require sizeable stretches of either purines 
or pyrimidines to be present on one strand of a duplex. Nucleotide sequences may be 
pyrimidine-based, which will result in TAT and CGC + triplets across the three associated 
strands of the resulting triple helix. The pyrimidine-rich molecules provide base 
complementarity to a purine-rich region of a single strand of the duplex in a parallel 
orientation to that strand. In addition, nucleic acid molecules may be chosen that are purine- 
rich, for example, contain a stretch of G residues. These molecules will form a triple helix 
with a DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting in GGC triplets across the three 
strands in the triplex. 

Alternatively, the potential sequences that can be targeted for triple helix formation 
may be increased by creating a so called "switchback" nucleic acid molecule. Switchback 
molecules are synthesized in an alternating 5-3', 3’-5' manner, such that they base pair with 
first one strand of a duplex and then the other, eliminating the necessity for a sizeable stretch 
of either purines or pyrimidines to be present on one strand of a duplex. 

In instances wherein the antisense, ribozyme, or triple helix molecules described 
herein are utilized to inhibit mutant gene expression, it is possible that the technique may so 
efficiently reduce or inhibit the transcription (triple helix) or translation (antisense, 
ribozyme) of mRNA produced by normal gene alleles of a BPI that the situation may arise 
wherein the concentration of BPI present may be lower than is necessary for a normal 
phenotype. In such cases, to ensure that substantially normal levels of activity of a gene 
encoding a BPI are maintained, gene therapy may be used to introduce into cells nucleic acid 
molecules that encode and express the BPI that exhibit normal gene activity and that do not 
contain sequences susceptible to whatever antisense, ribozyme, or triple helix treatments are 
being utilized. Alternatively, in instances whereby the gene encodes an extracellular protein, 
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normal BPI can be co-administered in order to maintain the requisite level of BPI activity. 

Antisense RNA and DNA, ribozyme, and triple helix molecules of the invention may 
be prepared by any method known in the art for the synthesis of DNA and RNA molecules, 
as discussed above. These include techniques for chemically synthesizing oligodeoxyri- 
5 bonucleotides and oligoribonucleotides well known in the art such as for example solid 
phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA sequences encoding the antisense 
RNA molecule. Such DNA sequences may be incorporated into a wide variety of vectors 
that incorporate suitable RNA polymerase promoters such as the T7 or SP6 polymerase 
10 promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 

constitutively or inducibly, depending on the promoter used, can be introduced stably into 
cell lines. 

5.15 Assays For Therapeutic or Prophylactic Compounds 
1 5 The present invention also provides assays for use in drug discovery in order to 

identify or verify the efficacy of compounds for treatment or prevention of breast cancer. 

Test compounds can be assayed for their ability to restore BF or BPI levels in a subject 
having breast cancer towards levels found in subjects free from breast cancer or to produce 
similar changes in experimental animal models of breast cancer. Compounds able to restore 
20 BF or BPI levels in a subject having breast cancer towards levels found in subjects free from 
breast cancer or to produce similar changes in experimental animal models of breast cancer 
can be used as lead compounds for further drug discovery, or used therapeutically. BF and 
BPI expression can be assayed by the Preferred Technology, immunoassays, gel 
electrophoresis followed by visualization, detection of BPI activity, or any other method 
25 taught herein or known to those skilled in the art. Such assays can be used to screen 

candidate drugs, in clinical monitoring or in drug development, where abundance of an BF 
or BPI can serve as a surrogate marker for clinical disease. 

In various specific embodiments, in vitro assays can be carried out with cells 
representative of cell types involved in a subject's disorder, to determine if a compound has a 
30 desired effect upon such cell types. 

Compounds for use in therapy can be tested in suitable animal model systems prior to 
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testing in humans, including but not limited to rats, mice, chicken, cows, monkeys, rabbits, 
etc. For in vivo testing, prior to administration to humans, any animal model system known 
in the art may be used. Examples of animal models of breast cancer include, but are not 
limited to, xenografts of human breast cancer cell lines such as MDA-MB-435 in estrogen- 
deprived Severe Combined Immunodefiecient (SCID) mice (Eccles et al., 1994 Cell 
Biophysics 24/25, 279). These can be utilized to test compounds that modulate BF or BPI 
levels since the pathology exhibited in these models is similar to that of breast cancer. It is 
also apparent to the skilled artisan that, based upon the present disclosure, transgenic 
animals can be produced with "knock-out" mutations of the gene or genes encoding one or 
more BPIs. A "knock-out" mutation of a gene is a mutation that causes the mutated gene to 
not be expressed, or expressed in an aberrant form or at a low level, such that the activity 
associated with the gene product is nearly or entirely absent. Preferably, the transgenic 
animal is a mammal, more preferably, the transgenic animal is a mouse. 

In one embodiment, test compounds that modulate the expression of a BPI are 
identified in non-human animals (e.g., mice, rats, monkeys, rabbits, and guinea pigs), 
preferably non-human animal models for breast cancer, expressing the BPI. In accordance 
with this embodiment, a test compound or a control compound is administered to the 
animals, and the effect of the test compound on expression of one or more BPIs is 
determined. A test compound that alters the expression of a BPI (or a plurality of BPIs) can 
be identified by comparing the level of the selected BPI or BPIs (or mRNA(s) encoding the 
same) in an animal or group of animals treated with a test compound with the level of the 
BPI(s) or mRNA(s) in an animal or group of animals treated with a control compound. 
Techniques known to those of skill in the art can be used to determine the mRNA and 
protein levels, for example, in situ hybridization. The animals may or may not be sacrificed 
to assay the effects of a test compound. 

In another embodiment, test compounds that modulate the activity of a BPI or a 
biologically active portion thereof are identified in non-human animals (e.g., mice, rats, 
monkeys, rabbits, and guinea pigs), preferably non-human animal models for breast cancer, 
expressing the BPI. In accordance with this embodiment, a test compound or a control 
compound is administered to the animals, and the effect of a test compound on the activity 
of a BPI is determined. A test compound that alters the activity of a BPI (or a plurality of 
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BPIs) can be identified by assaying animals treated with a control compound and animals 
treated with the test compound. The activity of the BPI can be assessed by detecting 
induction of a cellular second messenger of the BPI ( e.g ., intracellular Ca2+, diacylglycerol, 
IP3, etc.), detecting catalytic or enzymatic activity of the BPI or binding partner thereof, 

5 detecting the induction of a reporter gene (e.g . , a regulatory element that is responsive to a 
BPI of the invention operably linked to a nucleic acid encoding a detectable marker, such as 
luciferase or green fluorescent protein), or detecting a cellular response (e.g., cellular 
differentiation or cell proliferation). Techniques known to those of skill in the art can be 
utilized to detect changes in the activity of a BPI (see, e.g., U.S. Patent No. 5,401,639, which 
10 is incorporated herein by reference). 

In yet another embodiment, test compounds that modulate the level or expression of 
a BPI (or plurality of BPIs) are identified in human subjects having breast cancer, preferably 
those having breast cancer and most preferably those having severe breast cancer. In 
accordance with this embodiment, a test compound or a control compound is administered to 
15 the human subject, and the effect of a test compound on BPI expression is determined by 

analyzing the expression of the BPI or the mRNA encoding the same in a biological sample 
(e.g., a breast tissue biopsy or a body fluid such as serum, plasma, or urine). A test 
compound that alters the expression of a BPI can be identified by comparing the level of the 
BPI or mRNA encoding the same in a subject or group of subjects treated with a control 
20 compound to that in a subject or group of subjects treated with a test compound. 

Alternatively, alterations in the expression of a BPI can be identified by comparing the level 
of the BPI or mRNA encoding the same in a subject or group of subjects before and after the 
administration of a test compound. Techniques known to those of skill in the art can be used 
to obtain the biological sample and analyze the mRNA or protein expression. For example, 
25 the Preferred Technology described herein can be used to assess changes in the level of a 
BPI. 

In another embodiment, test compounds that modulate the activity of a BPI (or 
plurality of BPIs) are identified in human subjects having breast cancer, preferably those 
having breast cancer and most preferably those with severe breast cancer. In this 
30 embodiment, a test compound or a control compound is administered to the human subject, 
and the effect of a test compound on the activity of a BPI is determined. A test compound 
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that alters the activity of a BPI can be identified by comparing biological samples from 
subjects treated with a control compound to samples from subjects treated with the test 
compound. Alternatively, alterations in the activity of a BPI can be identified by comparing 
the activity of a BPI in a subject or group of subjects before and after the administration of a 
test compound. The activity of the BPI can be assessed by detecting in a biological sample 
(e.g., a breast tissue biopsy, or a ody fluid such as serum, plasma, or urine) induction of a 
cellular signal transduction pathway of the BPI (e.g., intracellular Ca2+, diacylglycerol, IP3, 
etc.), catalytic or enzymatic activity of the BPI or a binding partner thereof, or a cellular 
response, for example, cellular differentiation, or cell proliferation. Techniques known to 
those of skill in the art can be used to detect changes in the induction of a second messenger 
of a BPI or changes in a cellular response. For example, RT-PCR can be used to detect 
changes in the induction of a cellular second messenger. 

In a preferred embodiment, a test compound that changes the level or expression of a 
BPI towards levels detected in control subjects (e.g., humans free from breast cancer) is 
selected for further testing or therapeutic use. In another preferred embodiment, a test 
compound that changes the activity of a BPI towards the activity found in control subjects 
(e.g., humans free from breast cancer) is selected for further testing or therapeutic use. 

In another embodiment, test compounds that reduce the severity of one or more 
symptoms associated with breast cancer are identified in human subjects having breast 
cancer, preferably subjects having breast cancer and most preferably subjects with severe 
breast cancer. In accordance with this embodiment, a test compound or a control compound 
is administered to the subjects, and the effect of a test compound on one or more symptoms 
of breast cancer is determined. A test compound that reduces one or more symptoms can be 
identified by comparing the subjects treated with a control compound to the subjects treated 
with the test compound. Techniques known to physicians familiar with breast cancer can be 
used to determine whether a test compound reduces one or more symptoms associated with 
breast cancer. For example, a test compound that enhances memoiy or reduces confusion in 
a subject having breast cancer will be beneficial for treating subjects having breast cancer. 

In a preferred embodiment, a test compound that reduces the severity of one or more 
symptoms associated with breast cancer in a human having breast cancer is selected for 
further testing or therapeutic use. 
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5.16. Therapeutic and Prophylactic Compositions and Their Use 
The invention provides methods of treatment (and prophylaxis) comprising 
administering to a subject an effective amount of a compound of the invention. In a 
5 preferred aspect, the compound is substantially purified ( e.g ., substantially free from 

substances that limit its effect or produce undesired side-effects). The subject is preferably 
an animal, including but not limited to animals such as cows, pigs, horses, chickens, cats, 
dogs, etc., and is preferably a mammal, and most preferably human. In a specific 
embodiment, a non-human mammal is the subject. 

1 o Formulations and methods of administration that can be employed when the 

compound comprises a nucleic acid are described above; additional appropriate formulations 
and routes of administration are described below. 

Various delivery systems are known and can be used to administer a compound of 
the invention, e.g., encapsulation in liposomes, microparticles, microcapsules, recombinant 
15 cells capable of expressing the compound, receptor-mediated endocytosis (see, e.g., Wu and 
Wu, 1987, J. Biol. Chem. 262:4429-4432), construction of a nucleic acid as part of a 
retroviral or other vector, etc. Methods of introduction can be enteral or parenteral and 
include but are not limited to intradermal, intramuscular, intraperitoneal, intravenous, 
subcutaneous, intranasal, epidural, and oral routes. The compounds may be administered by 
20 any convenient route, for example by infusion or bolus injection, by absorption through 

epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and intestinal mucosa, etc.) and 
may be administered together with other biologically active agents. Administration can be 
systemic or local. In addition, it may be desirable to introduce the pharmaceutical 
compositions of the invention into the central nervous system by any suitable route, 

25 including intraventricular and intrathecal injection; intraventricular injection may be 

facilitated by an intraventricular catheter, for example, attached to a reservoir, such as an 
Ommaya reservoir. Pulmonary administration can also be employed, e.g., by use of an 
inhaler or nebulizer, and formulation with an aerosolizing agent. 

In a specific embodiment, it may be desirable to administer the pharmaceutical 
30 compositions of the invention locally to the area in need of treatment; this may be achieved, 
for example, and not by way of limitation, by local infusion during surgery, topical 
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application, e.g, in conjunction with a wound dressing after surgery, by injection, by means 
of a catheter, by means of a suppository, or by means of an implant, said implant being of a 
porous, non-porous, or gelatinous material, including membranes, such as sialastic 
membranes, or fibers. In one embodiment, administration can be by direct injection into 
5 serum or at the site (or former site) of a malignant tumour or neoplastic or pre-neoplastic 
tissue. 

In another embodiment, the compound can be delivered in a vesicle, in particular a 
liposome ( see Langer, 1990, Science 249:1527-1533; Treat et al., in Liposomes in the 
Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler (eds.), Liss, New 
10 York, pp. 353-365 (1989); Lopez-Berestein, ibid., pp. 317-327; see generally ibid.) 

In yet another embodiment, the compound can be delivered in a controlled release 
system. In one embodiment, a pump may be used (see Langer, supra ; Sefton, 1987, CRC 
Crit. Ref. Biomed. Eng. 14:201; Buchwald et al., 1980, Surgery 88:507; Saudek et al., 1989, 
N. Engl. J. Med. 321:574). In another embodiment, polymeric materials can be used (see 
15 Medical Applications of Controlled Release, Langer and Wise (eds.), CRC Pres., Boca 
Raton, Florida (1974); Controlled Drug Bioavailability, Drug Product Design and 
Performance, Smolen and Ball (eds.), Wiley, New York (1984); Ranger and Peppas, J., 

1983, Macromol. Sci. Rev. Macromol. Chem. 23:61; see also Levy et al., 1985, Science 
228:190; During et al., 1989, Ann. Neurol. 25:351; Howard et al., 1989, J. Neurosurg. 

20 7 1 : 1 05 ). In yet another embodiment, a controlled release system can be placed in proximity, 

of the therapeutic target, i.e., the brain, thus requiring only a fraction of the systemic dose 
(see, e.g., Goodson, in Medical Applications of Controlled Release, supra , vol. 2, pp. 
115-138(1984)). 

Other controlled release systems are discussed in the review by Langer (1990, 

25 Science 249:1527-1533). 

In a specific embodiment where the compound of the invention is a nucleic acid 
encoding a protein, the nucleic acid can be administered in vivo to promote expression of its 
encoded protein, by constructing it as part of an appropriate nucleic acid expression vector 
and administering it so that it becomes intracellular, e.g., by use of a retroviral vector (see 
30 U.S. Patent No. 4,980,286), or by direct injection, or by use of microparticle bombardment 
(e g * a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface receptors or 
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transfecting agents, or by administering it in linkage to a homeobox-like peptide which is 
known to enter the nucleus (see e.g ., Joliot et al., 1991, Proc. Natl. Acad. Sci. USA 
88:1864-1868), etc. Alternatively, a nucleic acid can be introduced intracellularly and 
incorporated within host cell DNA for expression, by homologous recombination. 

5 The present invention also provides pharmaceutical compositions. Such 

compositions comprise a therapeutically effective amount of a compound, and a 
pharmaceutically acceptable carrier. In a specific embodiment, the term "pharmaceutically 
acceptable" means approved by a regulatory agency of the Federal or a state government or 
listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use in 
10 animals, and more particularly in humans. The term "earner" refers to a diluent, adjuvant, 
excipient, or vehicle with which the therapeutic is administered. Such pharmaceutical 
carriers can be sterile liquids, such as water and oils, including those of petroleum, animal, 
vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the 
like. Water is a preferred carrier when the pharmaceutical composition is administered 
15 intravenously. Saline solutions and aqueous dextrose and glycerol solutions can also be 
employed as liquid carriers, particularly for injectable solutions. Suitable pharmaceutical 
excipients include starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, 
sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, glycerol, 
propylene, glycol, water, ethanol and the like. The composition, if desired, can also contain 
20 minor amounts of wetting or emulsifying agents, or pH buffering agents. These 

compositions can take the form of solutions, suspensions, emulsion, tablets, pills, capsules, 
powders, sustained-release formulations and the like. The composition can be formulated as 
a suppository, with traditional binders and carriers such as triglycerides. Oral formulation 
can include standard carriers such as pharmaceutical grades of mannitol, lactose, starch, 

25 magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, etc. Examples of 
suitable pharmaceutical carriers are described in "Remington's Pharmaceutical Sciences by 
E.W. Martin. Such compositions will contain a therapeutically effective amount of the 
compound, preferably in purified form, together with a suitable amount of carrier so as to 
provide the form for proper administration to the subject. The formulation should suit the 
30 mode of administration. 

In a preferred embodiment, the composition is formulated in accordance with routine 
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procedures as a pharmaceutical composition adapted for intravenous administration to 
human beings. Typically, compositions for intravenous administration are solutions in 
sterile isotonic aqueous buffer. Where necessary, the composition may also include a 
solubilizing agent and a local anesthetic such as lidocaine to ease pain at the site of the 
injection. Generally, the ingredients are supplied either separately or mixed together in unit 
dosage form, for example, as a dry lyophilized powder or water free concentrate in a 
hermetically sealed container such as an ampoule or sachette indicating the quantity of 
active agent. Where the composition is to be administered by infusion, it can be dispensed 
with an infusion bottle containing sterile pharmaceutical grade water or saline. Where the 
composition is administered by injection, an ampoule of sterile water for injection or saline 
can be provided so that the ingredients may be mixed prior to administration. 

The compounds of the invention can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with free amino groups such as those 
derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and those formed 
with free carboxyl groups such as those derived from sodium, potassium, ammonium, 
calcium, ferric hydroxides, isopropylamine, triethylamine, 2-ethylamino ethanol, histidine, 
procaine, etc. 

The amount of the compound of the invention which will be effective in the 
treatment of breast cancer can be determined by standard clinical techniques. In addition, in 
vitro assays may optionally be employed to help identify optimal dosage ranges. The 
precise dose to be employed in the formulation will also depend on the route of 
administration, and the seriousness of the disease or disorder, and should be decided 
according to the judgment of the practitioner and each subject's circumstances. However, 
suitable dosage ranges for intravenous administration are generally about 20-500 
micrograms of active compound per kilogram body weight. Suitable dosage ranges for 
intranasal administration are generally about 0.01 pg/kg body weight to 1 mg/kg body 
weight. Effective doses may be extrapolated from dose-response curves derived from in 
vitro or animal model test systems. 

Suppositories generally contain active ingredient in the range of 0.5% to 10% by 
weight; oral formulations preferably contain 10% to 95% active ingredient. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
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containers filled with one or more of the ingredients of the pharmaceutical compositions of 
the invention. Optionally associated with such containers) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects (a) approval by the agency of 
5 manufacture, use or sale for human administration, (b) directions for use, or both. 

6. EXAMPLE: IDENTIFICATION OF PROTEINS DIFFERENTIALLY EXPRESSED 
IN THE SERUM OF BREAST CANCER PATIENTS 
10 Using the following procedure, proteins in serum samples from (a) 15 patients 

having primary breast cancer, (b) 17 patients having metastatic breast cancer, and (c) 13 
unrelated control samples taken from subjects unaffected by breast cancer, were separated by 
isoelectric focusing followed by SDS-PAGE and analysed. Parts 6.1.1 to 6.1.19 (inclusive) 
of the procedure set forth below are hereby designated as the "Reference Protocol". 

15 

1. MATERIALS AND METHODS 

6.1.1 Sample Preparation 

A protein assay (Pierce BCA Cat # 23225) was performed on each serum sample as 
received. Prior to protein separation, each sample was processed for selective depletion of 
20 certain proteins, in order to enhance and simplify protein separation and facilitate analysis 
by removing proteins that may interfere with or limit analysis of proteins of interest. See 
International Patent Application No. PCT/GB99/01742, filed June 1, 1999, which is 
incorporated by reference in its entirety, with particular reference to pages 3 and 6. 

Removal of albumin, haptoglobin, transferrin and immunoglobin G (IgG) from 
25 serum ("serum depletion") was achieved by an affinity chromatography purification step in 
which the sample was passed through a series of ‘Hi-Trap* columns containing immobilized 
antibodies for selective removal of albumin, haptoglobin and transferrin, and protein G for 
selective removal of immunoglobin G. Two affinity columns in a tandem assembly were 
prepared by coupling antibodies to protein G-sepharose contained in Hi-Trap columns 
30 (Protein G-Sepharose Hi-Trap columns (1 ml) Pharmacia Cat. No. 17-0404-01). This was 
done by circulating the following solutions sequentially through the columns: (1) 
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Dulbecco’s Phosphate Buffered Saline (Gibco BRL Cat. No. 14190-094); (2) concentrated 
antibody solution; (3) 200 mM sodium carbonate buffer, pH 8.35; (4) cross-linking solution 
(200 mM sodium carbonate buffer, pH 8.35, 20 mM dimethylpimelimidate); and (5) 500 
mM ethanolamine, 500 mM NaCI. A third (un-derivatised) protein G Hi-Trap column was 
then attached to the lower end of the tandem column assembly. 

The chromatographic procedure was automated using an Akta Fast Protein Liquid 
Chromatography (FPLC) System such that a series of up to seven runs could be performed 
sequentially. The samples were passed through the series of 3 Hi-Trap columns in which the 
affinity chromatography media selectively bind the above proteins thereby removing them 
from the sample. Fractions (typically 3 ml per tube) were collected of unbound material 
( Flowthrough fractions") that eluted through the column during column loading and 
washing stages and of bound proteins ("Bound/Eluted fractions") that were eluted by step 
elution with Immunopure Gentle Ag/Ab Elution Buffer (Pierce Cat. No. 21013). The eluate 
containing unbound material was collected in fractions which were pooled, 
desalted/concentrated by centrifugal ultrafiltration and stored to await further analysis by 2D 
PAGE. 

A volume of depleted serum containing approximately 300 pg of total protein was 
aliquoted and an equal volume of 10% (w/v) SDS (Fluka 71729), 2.3% (w/v) dithiothreitol 
(BDH 443852A) was added. The sample was heated at 95°C for 5 mins, and then allowed to 
cool to 20°C . 125pl of the following buffer was then added to the sample: 

8M urea (BDH 452043w ) 

4% CHAPS (Sigma C3023) 

65mM dithiotheitol (DTT) 

2% (v/v) Resolytes 3.5-10 (BDH 44338 2x) 

This mixture was vortexed, and centrifuged at 13000 rpm for 5 mins at 15°C, and the 
supernatant was analyzed by isoelectric focusing. 
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6.1.2 Isoelectric Focusing 

Isoelectric focusing (IEF), was performed using the Immobiline® DxyStrip Kit 
(Pharmacia BioTech), following the procedure described in the manufacturer’s instructions, 
see Instructions for Immobiline® DryStrip Kit, Pharmacia, # 18-1038-63, Edition AB 
5 (incorporated herein by reference in its entirety). Immobilized pH Gradient (IPG) strips 

(18cm, pH 3-10 non-linear strips; Pharmacia Cat. # 17-1235-01) were rehydrated overnight 
at 20°C in a solution of 8M urea, 2% (w/v) CHAPS, lOmM DTT, 2% (v/v) Resolytes 3.5- 
10, as described in the Immobiline DryStrip Users Manual. For IEF, 50 pi of supernatant 
(prepared as above) was loaded onto a strip, with the cup-loading units being placed at the 
10 basic end of the strip. The loaded gels were then covered with mineral oil (Pharmacia 17- 
3335-01) and a voltage was immediately applied to the strips according to the following 
profile, using a Pharmacia EPS3500XL power supply (Cat 19-3500-01): 

Initial voltage = 300V for 2 hrs 
1 5 Linear Ramp from 300V to 3500V over 3hrs 

Hold at 3500V for 19hrs 

For all stages of the process, the current limit was set to 10mA for 12 gels, and the 
wattage limit to 5W. The temperature was held at 20°C throughout the run. 

20 

6.1.3 Gel Equilibration and SDS-PAGE 

After the final 19hr step, the strips were immediately removed and immersed for 10 
mins at 20°C in a first solution of the following composition: 6M urea; 2% (w/v) DTT; 2% 
(w/v) SDS; 30% (v/v) glycerol (Fluka 49767); 0.05M Tris/HCl, pH 6.8 (Sigma Cat T-1503). 
25 The strips were removed from the first solution and immersed for 10 mins at 20°C in a 

second solution of the following composition: 6M urea; 2% (w/v) iodoacetamide (Sigma I- 
6125); 2% (w/v) SDS; 30% (v/v) glycerol; 0.05M Tris/HCl, pH 6.8. After removal from the 
second solution, the strips were loaded onto supported gels for SDS-PAGE according to 
Hochstrasser et al., 1988, Analytical Biochemistry 173: 412-423 (incorporated herein by 
30 reference in its entirety), with modifications as specified below. 
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6- 1 .4 Preparation of supported gels The gels were cast between two glass plates of 
the following dimensions: 23cm wide x 24cm long (back plate); 23cm wide x 24cm long 
with a 2cm deep notch in the central 19cm (front plate). To promote covalent attachment of 
SDS-PAGE gels, the back plate was treated with a 0.4% solution of y-methacryl- 
oxypropyltrimethoxysilane in ethanol (BindSilane™; Pharmacia Cat. # 17-1330-01). The 
front plate was treated with (RepelSilane™ Pharmacia Cat. # 17-1332-01) to reduce 
adhesion of the gel. Excess reagent was removed by washing with water, and the plates 
were allowed to dry. At this stage, both as identification for the gel, and as a marker to 
identify the coated face of the plate, an adhesive bar-code was attached to the back plate in a 
position such that it would not come into contact with the gel matrix. 

The dried plates were assembled into a casting box with a capacity of 13 gel 
sandwiches. The top and bottom plates of each sandwich were spaced by means of 1mm 
thick spacers, 2.5 cm wide. The sandwiches were interleaved with acetate sheets to facilitate 
separation of the sandwiches after gel polymerization. Casting was then carried out 
according to Hochstrasser et al., op . cit. 

A 9-16% linear polyacrylamide gradient was cast, extending up to a point 2cm below 
the level of the notch in the front plate, using the Angelique gradient casting system (Large 
Scale Biology). Stock solutions were as follows. Acrylamide (40% in water) was from 
Serva (Cat. # 10677). The cross-linking agent was PDA (BioRad 161-0202), at a 
concentration of 2.6% (w/w) of the total starting monomer content. The gel buffer was 
0.375M Tris/HCl, pH 8.8. The polymerization catalyst was 0.05% (v/v) TEMED (BioRad 
161-0801), and the initiator was 0.1% (w/v) APS (BioRad 161-0700). NoSDSwas 
included in the gel and no stacking gel was used. The cast gels were allowed to polymerize 
at 20°C overnight, and then stored at 4°C in sealed polyethylene bags with 6ml of gel buffer, 
and were used within 4 weeks. 

6.1.5 SDS-PAGE 

A solution of 0.5% (w/v) agarose (Fluka Cat 05075) was prepared in running buffer 
(0.025M Tris, 0.1 98M glycine (Fluka 50050), 1% (w/v) SDS, supplemented by a trace of 
bromophenol blue). The agarose suspension was heated to 70°C with stirring, until the 
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agarose had dissolved. The top of the supported 2 nd D gel was filled with the agarose 
solution, and the equilibrated strip was placed into the agarose, and tapped gently with a 
palette knife until the gel was intimately in contact with the 2 nd D gel. The gels were placed 
in the 2 nd D running tank, as described by Amess et al., 1995, Electrophoresis 16: 1255-1267 
5 (incorporated herein by reference in its entirety). The tank was filled with running buffer (as 
above) until the level of the buffer was just higher than the top of the region of the 2 nd D gels 
which contained polyacrylamide, so as to achieve efficient cooling of the active gel area. 
Running buffer was added to the top buffer compartments formed by the gels, and then 
voltage was applied immediately to the gels using a Consort E-833 power supply. For 1 
10 hour, the gels were run at 20mA/gel. The wattage limit was set to 150W for a tank 

containing 6 gels, and the voltage limit was set to 600V. After 1 hour, the gels were then 
run at 40mA/gel, with the same voltage and wattage limits as before, until the bromophenol 
blue line was 0.5cm from the bottom of the gel. The temperature of the buffer was held at 
16°C throughout the run. Gels were not run in duplicate. 

15 

6.1.6 Staining 

Upon completion of the electrophoresis run, the gels were immediately removed 
from the tank for fixation. The top plate of the gel cassette was carefully removed, leaving 
the gel bonded to the bottom plate. The bottom plate with its attached gel was then placed 
20 into a staining apparatus, which can accommodate 12 gels. The gels were completely 
immersed in fixative solution of 40% (v/v) ethanol (BDH 28719), 10% (v/v) acetic acid 
(BDH 100016X), 50% (v/v) water (MilliQ-Millipore), which was continuously circulated 
over the gels. After an overnight incubation, the fixative was drained from the tank, and the 
gels were primed by immersion in 7.5% (v/v) acetic acid, 0.05% (w/v) SDS, 92.5% (v/v) 

25 water for 30 mins. The priming solution was then drained, and the gels were stained by 

complete immersion for 4 hours in a staining solution of Sypro Red (Molecular Probes, Inc., 
Eugene, Oregon). Alternative dyes which can be used for this purpose ared described in 
USSn 09/412, 168, filed October 5 1999, and incorporated herein by reference in its entirety. 

30 6.1.7 Imaging of the gel 

A computer-readable output was produced by imaging the fluorescently stained gels 
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with the Apollo 2 scanner (Oxford Glycosciences, Oxford, UK) described in section 5.1, 
supra. This scanner has a gel carrier with four integral fluorescent markers (Designated Ml, 
M2, M3, M4) that are used to correct the image geometry and are a quality control feature to 
confirm that the scanning has been performed correctly. 

For scanning, the gels were removed from the stain, rinsed with water and allowed to 
air dry briefly, and imaged on the Apollo 2. After imaging, the gels were sealed in 
polyethylene bags containing a small volume of staining solution, and then stored at 4°C. 

6.1.8 Digital Analysis of the Data 

The data were processed as described in U.S. Application Serial No. 08/980,574, 
(published as WO 98/23950) at Sections 5.4 and 5.5 (incoiporated herein by reference), as 
set forth more particularly below. 

The output from the scanner was first processed using the MELANIE® II 2D PAGE 
analysis program (Release 2.2, 1997, BioRad Laboratories, Hercules, California, Cat. # 170- 
7566) to autodetect the registration points, Ml, M2, M3 and M4; to autocrop the images 
(i.e., to eliminate signals originating from areas of the scanned image lying outside the 
boundaries of the gel, e.g. the reference frame); to filter out artifacts due to dust; to detect 
and quantify features; and to create image files in GIF format. Features were detected using 
the following parameters: 

Smooths =2 
Laplacian threshold 50 
Partials threshold 1 
Saturation = 100 
Peakedness = 0 
Minimum Perimeter = 10 
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6.1.9 Assignment of pi and MW Values 

Landmark identification was used to determine the pi and MW of features detected in 
the images. Eleven landmark features, designated DS1, DS2, DS4, DS5, DS6, DS8, DS9, 
DS10, DS11, DS12, and DS13 were identified in a standard serum image. These landmark 
5 features are identified in Figure 1 and were assigned the pi and MW values identified in 
Table Xin. 


Table XUI. Landmark Features Used in this Study 


Name 

PI 

MW (Da) 

Name 

pi 

MW (Da) 

DS1 

5.55 

185070 

DS9 

5.22 

23000 

DS2 

6.20 

100000 

DS10 

5.52 

13800 

DS4 

5.15 

73470 

DS11 

6.65 

56170 

DS5 

4.10 

44160 

DS12 

9.01 

12060 

DS6 

6.98 

31720 

DS13 

4.75 

41230 

DS8 

4.47 

23920 





As many of these landmarks as possible were identified in each gel image of the data 
10 set. Each feature in the study gels was then assigned a pi value by linear interpolation or 
extrapolation (using the MELANIE®-II software) to the two nearest landmarks, and was 
assigned a MW value by linear interpolation or extrapolation (using the MELAME®-II 
software) to the two nearest landmarks. 

6.1.10 Matching With Primary Master Image 

15 Images were edited to remove gross artifacts such as dust, to reject images which had 

gross abnormalities such as smearing of protein features, or were of too low a loading or 
overall image intensity to allow identification of more than the most intense features, or 
were of too poor a resolution to allow accurate detection of features. Images were then 
compared by pairing with one common image from the whole sample set. This common 
20 image, the "primary master image", was selected on the basis of protein load (maximum 
load consistent with maximum feature detection), a well resolved myoglobin region, 
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(myoglobin was used as an internal standard), and general image quality. Additionally, the 
primary master image was chosen to be an image which appeared to be generally 
representative of all those to be included in the analysis. (This process by which a primary 
master gel was judged to be representative of the study gels was rechecked by the method 
5 described below and in the event that the primary master gel was seen to be 

unrepresentative, it was rejected and the process repeated until a representative primary 
master gel was found.) 

Each of the remaining study gel images was individually matched to the primary 
master image such that common protein features were paired between the primary master 
10 image and each individual study gel image as described below. 

6.1.11 Cross-matching Between Samples 

To facilitate statistical analysis of large numbers of samples for purposes of 
identifying features that are differentially expressed, the geometry of each study gel was 
adjusted for maximum alignment between its pattern of protein features, and that of the 
15 primary master, as follows. Each of the study gel images was individually transformed into 
the geometry of the primary master image using a multi-resolution warping procedure. This 
procedure corrects the image geometry for the distortions brought about by small changes in 
the physical parameters of the electrophoresis separation process from one sample to 
another. The observed changes are such that the distortions found are not simple geometric 
20 distortions, but rather a smooth flow, with variations at both local and global scale. 

The fundamental principle in multi-resolution modeling is that smooth signals may 
be modeled as an evolution through 'scale space’, in which details at successively finer scales 
are added to a low resolution approximation to obtain the high resolution signal. This type of 
model is applied to the flow field of vectors (defined at each pixel position on the reference 
25 image) and allows flows of arbitrary smoothness to be modeled with relatively few degrees 
of freedom. Each image is first reduced to a stack, or pyramid, of images derived from the 
initial image, but smoothed and reduced in resolution by a factor of 2 in each direction at 
every level (Gaussian pyramid) and a corresponding difference image is also computed at 
each level, representing the difference between the smoothed image and its progenitor 
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(Laplacian pyramid). Thus the Laplacian images represent the details in the image at 
different scales. 

To estimate the distortion between any 2 given images, a calculation was performed 
at level 7 in the pyramid (i.e. after 7 successive reductions in resolution). The Laplacian 
5 images were segmented into a grid of 16x16 pixels, with 50% overlap between adjacent grid 
positions in both directions, and the cross correlation between corresponding grid squares on 
the reference and the test images was computed. The distortion displacement was then given 
by the location of the maximum in the correlation matrix. After all displacements had been 
calculated at a particular level, they were interpolated to the next level in the pyramid, 

10 applied to the test image, and then further corrections to the displacements were calculated at 
the next scale. 

The warping process brought about good alignment between the common features in 
the primary master image, and the images for the other samples. The MELANIE® II 2D 
PAGE analysis program was used to calculate and record approximately 500-700 matched 
1 5 feature pairs between the primary master and each of the other images. The accuracy of this 
program was significantly enhanced by the alignment of the images in the manner described 
above. To improve accuracy still further, all pairings were finally examined by eye in the 
MelView interactive editing program and residual recognizably incorrect pairings were 
removed. Where the number of such recognizably incorrect pairings exceeded the overall 
20 reproducibility of the Preferred Technology (as measured by repeat analysis of the same 
biological sample) the gel selected to be the primary master gel was judged to be 
insufficiently representative of the study gels to serve as a primary master gel. In that case, 
the gel chosen as the primary master gel was rejected, and different gel was selected as the 
primary master gel, and the process was repeated. 

25 All the images were then added together to create a composite master image, and the 

positions and shapes of all the gel features of all the component images were super-imposed 
onto this composite master as described below. 

Once all the initial pairs had been computed, corrected and saved, a second pass was 
performed whereby the original (unwaiped) images were transformed a second time to the 
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geometry of the primary master, this time using a flow field computed by smooth 
interpolation of the multiple tie-points defined by the centroids of the paired gel features. A 
composite master image was thus generated by initialising the primary master image with its 
feature descriptors. As each image was transformed into the primary master geometry, it was 
5 digitally summed pixel by pixel into the composite master image, and the features that had 
not been paired by the procedure outlined above were likewise added to the composite 
master image description, with their centroids adjusted to the master geometry using the 
flow field correction. 

The final stage of processing was applied to the composite master image and its 
10 feature descriptors, which now represent all the features from all the images in the study 

transformed to a common geometry. The features were grouped together into linked sets or 
"clusters", according to the degree of overlap between them. Each cluster was then given a 
unique identifying index, the molecular cluster index (MCI). 

An MCI identifies a set of matched features on different images. Thus an MCI 
1 5 represents a protein or proteins eluting at equivalent positions in the 2D separation in 

different samples. 

6.1.12. Construction of Profiles 

After matching all component gels in the study to the final composite master image, 
the intensity of each feature was measured and stored. The end result of this analysis was 
20 the generation of a digital profile which contained, for each identified feature: 1) a unique 

identification code relative to corresponding feature within the composite master image 
(MCI), 2) the x, y coordinates of the features within the gel, 3) the isoelectric point (pi) of 
the BFs, 4) the apparent molecular weight (MW) of the BFs, 5) the signal value, 6) the 
standard deviation for each of the preceding measurements, and 7) a method of linking the 
25 MCI of each feature to the master gel to which this feature was matched. By virtue of a 

Laboratory Information Management System (LIMS), this MCI profile was traceable to the 
actual stored gel from which it was generated, so that proteins identified by computer 
analysis of gel profile databases could be retrieved. The LIMS also permitted the profile Xo v 
be traced back to an original sample or patient. 
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6.1.13. Statistical Analysis of the Profiles 

The complementary statistical strategies specified below were used to identify BFs 
from the MCIs within the mastergroup. However, the skilled artisan would be able to select 
additional statistical methods for use and the current invention is not intended to be limited 
5 to these methods of analysis. 

A fold change representing the ratio of the averages of each of the BFs within an 
MCI was calculated for each MCI between each set of controls and breast cancer samples. A 
95% confidence limit for the mean of the fold changes was calculated. The MCIs with fold 
changes which fall either above or below the confidence limit were selected as BFs which 
10 met the criteria of the significant fold change threshold with 95% selectivity. Because the 
MCI fold changes are based on a 95% confidence limit, it follows that the significant fold 
change threshold is itself 95%. 

A second non-overlapping strategy is based on the use of the Wilcoxon Rank-Sum 
test. This test was performed between the control and the breast cancer samples for each 
15 MCI basis. The MCIs which recorded a p-value less than or equal to 0.05 were selected as 
statistically significant BFs with 95% selectivity. 

A third non-overlapping selection strategy is based on qualitative presence or 
absence alone. Using this procedure, a percentage feature presence was calculated across the 
control samples and breast cancer samples for each MCI which was a potential BF based on 
20 such qualitative criteria alone i.e. presence or absence. The MCIs which recorded a 

percentage feature presence of 95% or more on breast cancer samples and a percentage 
feature presence of 5% or less on control samples, were selected as the qualitative 
differential BFs with 95% selectivity. A second group of qualitative differential BFs with 
95% selectivity were formed by those MCIs which recorded a percentage feature presence of 
25 95% or more on control samples and a percentage feature presence of 5% or less on breast 

cancer serum samples. 

Without limitation, application of any or more than one of these three analysis 
strategies allowed BFs to be selected on the basis of (a) a significant fold change threshold 
with a chosen selectivity, or (b) statistical significance as measured by he Wilcoxon Rank- 
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Sum test, or (c) qualitative differences with a chosen selectivity. 

The ERFs were present in all serum samples and the coefficient of variation was less 
than 1 0% across all samples. 

6.1.14 Recovery and analysis of selected proteins 

5 Proteins in BFs were robotically excised and processed to generate tryptic peptides; 

partial amino acid sequences of these peptides were' determined by mass spectroscopy, using 
techniques known to those skilled in the art as well as de novo sequencing, as described in 
Application No. 08/877,605, filed June 18, 1997 (published as W098/53323) and 
Application No. 09/094,996, filed June 15, 1998, each of which is incorporated herein by 
10 reference in its entirety. 

6.2 RESULTS 

These initial experiments identified : 13 features that were decreased and 7 features 
that were increased in serum from 15 primary breast cancer patients as compared with serum 
from 13 patients unaffected by breast cancer; 15 features that were decreased and 7 features 
15 that were increased in the serum from 17 metastatic breast cancer patients as compared with 
serum from 1 3 patients unaffected by breast cancer. Details of these BFs are provided in 
Tables I, U, IH and IV. Each BF was differentially present in breast cancer serum as 
compared with normal serum (p<0.05). For some preferred BFs (BF-3, BF-13, BF-19, BF- 
22, BF-26, BF-28, BF-40) the difference was highly significant (p < 0.01). 

20 Partial amino acid sequences were determined for the differentially present BPIs in 

these BFs. Details of these BPIs are provided in Tables VI, VII, VIII and IX. Computer 
searches of public databases identified at least 16 BPIs for which neither the partial amino 
acid sequence (comprising the core sequence and N-terminal and C-terminal masses as 
described in Table XU), nor any oligonucleotide encoding such a partial amino acid 
25 sequence, was described in any public database examined. 

The present invention is not to be limited in terms of the particular embodiments 
described in this application, which are intended as single illustrations of individual aspects 
of the invention. Functionally equivalent methods and apparatus within the scope of the 


114 



WO 01/13117 


PCT/GB00/O3 143 


invention, in addition to those enumerated herein, will be apparent to those skilled in the art 
from the foregoing description and accompanying drawings. Such modifications and 
variations are intended to fall within the scope of the appended claims. The contents of each 
reference, patent and patent application cited in this application is hereby incorporated by 
5 reference in its entirety. 
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CLAIMS: 

A method for screening, diagnosis or prognosis of breast cancer in a subject, for 
determining the stage or severity of breast cancer in a subject, for identifying a 
subject at risk of developing breast cancer, or for monitoring the effect of therapy 
administered to a subject having breast cancer, said method comprising: 

(a) analyzing a test sample of body fluid from the subject by two dimensional 
electrophoresis to generate a two-dimensional array of features, said array 
comprising at least one chosen feature whose relative abundance correlates with 
the presence, absence, stage or severity of breast cancer or predicts the onset or 
course of breast cancer; and 

(b) comparing the abundance of each chosen feature in the test sample with the 
abundance of that chosen feature in body fluid from one or more persons free 
from breast cancer, or with a previously determined reference range for that 
feature in subjects free from breast cancer, or with the abundance at least one 
Expression Reference Feature (ERF) in the test sample. 


The method of claim 1 , wherein the body fluid is serum or plasma. 


The method of claim 1 or claim 2, wherein said method is for screening or diagnosis 
of breast cancer and the relative abundance of at least one chosen feature correlates 
with the presence or absence of breast cancer. 


The method of claim 1 or claim 2, wherein said method is for monitoring the effect 
of therapy administered to a subject having breast cancer and the relative abundance 
of at least one chosen feature correlates with the severity of breast cancer. 


The method of claim 2, wherein step (b) comprises comparing the abundance of each 
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chosen feature in the sample with the abundance of that chosen feature in serum from 
one or more persons free from breast cancer or with a previously determined 
reference range for that chosen feature in subjects free from breast cancer. 

5 6. The method of claim 1 or claim 2, wherein step (b) comprises quantitatively 

detecting one or more of the following Breast Cancer- Associated Features (BFs): BF- 
1, BF-2, BF-3, BF-4, BF-5, BF-7, BF-8, BF-9, BF-10, BF-12, BF-13, BF-14, BF-15, 
BF-16, BF-17, BF-18, BF-19, BF-20, BF-22, BF-23, BF-26, BF-27, BF-28, BF-29, 
BF-30, BF-31, BF-32, BF-33, BF-34, BF-35, BF-36, BF-37, BF-38, BF-39, BF-40, 
10 BF-41, BF-42, BF-43, BF-44, BF-45, BF-46, BF-47, BF-48. 

7. The method according to any one of claims 1 to 6, wherein step (a) comprises 
isoelectric focussing followed by sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE). 

15 

8. A method for screening, diagnosis or prognosis of breast cancer in a subject, for 
determining the stage or severity of breast cancer in a subject, for identifying a 
subject at risk of developing breast cancer, or for monitoring the effect of therapy 
administered to a subject having breast cancer, said method comprising 

20 quantitatively detecting, in a sample of serum or plasma from the subject, at least one 

of the following Breast Cancer-Associated Protein Iso forms (BPIs): BPI-1, BPI-5, 
BPI-6, BPI-9, BPI-1 0, BPI-1 1, BPI-1 2, BPI-13, BPI-14, BPI-1 9, BPI-20, BPI-21, 
BPI-23, BPI-24, BPI-25, BPI-27, BPI-28, BPI-29, BPI-31, BPI-32, BPI-33, BPI-34, 
BPI-37, BPI-40, BPI-41, BPI-42, BPI-43, BPI-44, BPI-45, BPI-46, BPI-47, BPI-48, 
25 BPI-49, BPI-50, BPI-5 1 , BPI-52, BPI-53, BPI-54, BPI-55, BPI-56. 

9. The method according to claim 8, wherein the step of quantitatively detecting 
comprises testing at least one aliquot of the sample, said step of testing comprising: 
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(a) contacting the aliquot with an antibody that is immunospecific for a 
preselected BPI; and 

(b) quantitatively measuring any binding that has occurred between the antibody 
and at least one species in the aliquot. 


10. The method according to claim 9, wherein the step of quantitatively detecting 
comprises testing a plurality of aliquots with a plurality of antibodies for quantitative 
detection of a plurality of preselected BPIs. 

1 1 . The method according to claim 9 or claim 10, wherein the antibody or antibodies 
is/are monoclonal antibodies. 

12. A preparation comprising one of the following isolated Breast Cancer- Associated 
Protein Isoforms (BPIs): BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, BPI-11, BPI-12, BPI- 
13, BPI- 14, BPI- 19, BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, BPI-27, BPI-28, BPI- 
29, BPI-31, BPI-32, BPI-33, BPI-34, BPI-37, BPI-40, BPI-41, BPI-42, BPI-43, BPI- 
44, BPI-45, B PI-46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-5 1, BPI-52, BPI-53, BPI- 
54, BPI-55, BPI-56. 

13. A kit comprising at least one preparation as claimed in claim 12. 

14. A preparation comprising an isolated human protein, said protein comprising a 
peptide having the following sequence : ECQ. 

1 5. The preparation according to claim 14 wherein the protein has an isoelectric point 
(pi) of about 4.31 and an apparent molecular weight (MW) of about 27930. 
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16. The preparation according to claim 15, wherein the pi is within 10% of 4.31 and the 
MW is within 10% of 27930. 

17. The preparation according to claim 15, wherein the pi is within 5% of 4.31 and the 

5 MW is within 5% of 27930. 

1 8 . The preparation according to claim 1 5, wherein the pi is within 1 % of 4.3 1 and the 
MW is within 1% of 27930. 

i 

10 19. A preparation comprising an isolated human protein, said protein comprising a 

peptide having the following sequence: CQATGFSPR or CQATGMSPR. 

20. The preparation according to claim 19 wherein the protein has an isoelectric point 
(pi) of about 6.44 and an apparent molecular weight (MW) of about 44960. 

15 

21. The preparation according to claim 20, wherein the pi is within 10% of 6.44 and the 
MW is within 1 0% of 44960. 

22. The preparation according to claim 20, wherein the pi is within 5% of 6.44 and the 

20 MW is within 5% of 44960. 

23. The preparation according to claim 20, wherein the pi is within 1% of 6.44 and the 
MW is within 1% of 44960. 

25 24. A preparation comprising an isolated human protein, said protein comprising a 
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peptide having the following sequence: DDF or DDM. 


25. A preparation comprising an isolated human protein, said protein comprising a 
peptide having the following sequence: LEFFPR, IEFFPR, LEMFPR, LEFMPR, 
LEMMPR, IEMFPR, LEFMPR, or IEMMPR. 

26. The preparation according to claim 25 wherein the isolated protein further comprises 
a peptide having the following sequence: LEFFPR, IEFFPR, LEMFPR, LEFMPR, 
LEMMPR, IEMFPR, IEFMPR, or IEMMPR. 

27. The preparation according to any one of claims 24, 25 or 26 wherein the protein has 
an isoelectric point (pi) of about 5.34 and an apparent molecular weight (MW) of 
about 16620. 

28. The preparation according to claim 27, wherein the pi is within 1 0% of 5.34 and the 
MW is within 10% of 16620. 


The preparation according to claim 27, wherein the pi is within 5% of 5.34 and the 
MW is within 5% of 16620. 


30. The preparation according to claim 27, wherein the pi is within 1% of 5.34 and the 
MW is within 1% of 16620. 

31. A preparation comprising an isolated human protein, said protein comprising a 
peptide having the following sequence: AN or AGG. 
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32. The preparation according to claim 3 1 wherein the protein has an isoelectric point 
(pi) of about 6.08 and an apparent molecular weight (MW) of about 59520. 

33. The preparation according to claim 32, wherein the pi is within 10% of 6.08 and the 

5 MW is within 10% of 59520. 

34. The preparation according to claim 32, wherein the pi is within 5% of 6.08 and the 
MW is within 5% of 59520. 

10 35. The preparation according to claim 32, wherein the pi is within 1 % of 6.08 and the 

MW is within 1% of 59520. 

36. A preparation comprising an isolated human protein, said protein comprising one or 
more peptides having the following sequence: VYQ, LLEN, LIEN, LLEN, HEN, 

1 5 LLEGG, LI EGG, ILEGG, HEGG, or PA. 

37. The preparation according to claim 36 wherein the protein has an isoelectric point 
(pi) of about 6.02 and an apparent molecular weight (MW) of about 59410. 

20 38. The preparation according to claim 37, wherein the pi is within 10% of 6.02 and the 

MW is within 10% of 59410. 

39. The preparation according to claim 37, wherein the pi is within 5% of 6.02 and the 
MW is within 5% of 59410. 

25 
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The preparation according to claim 37, wherein the pi is within 1% of 6.02 and the 
MW is within 1% of 59410. 


A preparation comprising an isolated human protein, said protein comprising a 
peptide having the following sequence: CYCQK. 


The preparation according to claim 41 wherein the protein has an isoelectric point 
(pi) of about 7.27 and an apparent molecular weight (MW) of about 30450. 


The preparation according to claim 42, wherein the pi is within 10% of 7.27 and the 
MW is within 10% of 30450. 


The preparation according to claim 42, wherein the pi is within 5% of 7.27 and the 
MW is within 5% of 30450. 


The preparation according to claim 42, wherein the pi is within 1% of 7.27 and the 
MW is within 1% of 30450. 


A preparation comprising an isolated human protein, said protein comprising one or 
more peptides having the following sequence: LDDYLN, LDDYIN, IDDYLN, 
IDDYIN, LDDYLGG, LDDYIGG, IDDYLGG, IDDYIGG, HAQ, EL or EL 


The preparation according to claim 46 wherein the protein has an isoelectric point 
(pi) of about 5.97 and an apparent molecular weight (MW) of about 91410. 
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48. The preparation according to claim 47, wherein the pi is within 10% of 5.97 and the 
MW is within 10% of 91410. 

49. The preparation according to claim 47, wherein the pi is within 5% of 5.97 and the 

5 MW is within 5% of 91410. 

50. The preparation according to claim 47, wherein the pi is within 1% of 5.97 and the 
MW is within 1% of 91410. 

10 51. A preparation comprising an isolated human protein, said protein comprising a 

peptide having the following sequence: FGPVPR or MGPVPR. 

52. The preparation according to claim 51 wherein the protein has an isoelectric point 
(pi) of about 4.79 and an apparent molecular weight (MW) of about 47130. 

15 

53. The preparation according to claim 52, wherein the pi is within 10% of 4.79 and the 
MW is within 10% of 47130. 

54. The preparation according to claim 52, wherein the pi is within 5% of 4.79 and the 

20 MW is within 5% of 471 30. 

55. The preparation according to claim 52, wherein the pi is within 1% of 4.79 and the 
MW is within 1% of 47130. 

25 56. A preparation comprising an isolated human protein, said protein comprising one or 
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more peptides having the following sequence: YCT or WEE. 


The preparation according to claim 56 wherein the protein has an isoelectric point 
(pi) of about 6.15 and an apparent molecular weight (MW) of about 191760. 


58. The preparation according to claim 57, wherein the pi is within 1 0% of 6. 1 5 and the 
MW is within 1 0% of 1 9 1 760. 


59. The preparation according to claim 57, wherein the pi is within 5% of 6. 1 5 and the 
MW is within 5% of 191760, 


60. The preparation according to claim 57, wherein the pi is within 1% of 6. 15 and the 
MW is within 1 % of 1 9 1 760. 

61. A preparation comprising an isolated human protein, said protein comprising one or 
more peptides having the following sequence: WLGD, DALGD, ADLGD, EGLGD, 
GELGD, VSLGD, SVLGD, WIGD, DAIGD, ADIGD, EGIGD, GEIGD, VSIGD, 
SVIGD, QCWDFFR, QCWDMFR, QCWDFMR, or QCWDMMR. 

62. The preparation according to claim 61 wherein the protein has an isoelectric point 
(pi) of about 6.07 and an apparent molecular weight (MW) of about 33400. 

63. The preparation according to claim 62, wherein the pi is within 10% of 6.07 and the 
MW is within 10% of 33400. 
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64. The preparation according to claim 62, wherein the pi is within 5% of 6.07 and the 
MW is within 5% of 33400. 

65. The preparation according to claim 62, wherein the pi is within 1% of 6.07 and the 

5 MW is within 1% of 33400. 

66. A preparation comprising an isolated human protein, said protein comprising one or 
more peptides having the following sequence: WLQV, DALQV, ADLQV, EGLQV, 
GELQV, VSLQV, SVLQV, WIQV, DAIQV, ADIQV, EGIQV, GEIQV, VSIQV, 

1 0 SVIQV, YFV, YMV, WLQG, DALQG, ADLQG, EGLQG, GELQG, VSLQG, 

SVLQG, WIQG, DAIQG, ADIQG, EGIQG, GEIQG, VSIQG, or SVIQG. 

67. The preparation according to claim 66 wherein the protein has an isoelectric point 
(pi) of about 5.38 and an apparent molecular weight (MW) of about 67290. 

15 

68. The preparation according to claim 67, wherein the pi is within 10% of 5.38 and the 
MW is within 10% of 67290. 

69. The preparation according to claim 67, wherein the pi is within 5% of 5.38 and the 

20 MW is within 5% of 67290. 

70. The preparation according to claim 67, wherein the pi is within 1% of 5.38 and the 
MW is within 1% of 67290. 

25 71. A preparation comprising an isolated human protein, said protein comprising a 

peptide having the following sequence: DESLQVAER or DESIQVAER. 
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The preparation according to claim 71 wherein the protein has an isoelectric point 
(pi) of about 4.98 and an apparent molecular weight (MW) of about 35440. 

The preparation according to claim 72, wherein the pi is within 10% of 4.98 and the 
MW is within 10% of 35440. 


74. The preparation according to claim 72, wherein the pi is within 5% of 4.98 and the 
MW is within 5% of 35440. 


The preparation according to claim 72, wherein the pi is within 1% of 4.98 and the 
MW is within 1% of 35440. 


76. A preparation comprising an isolated human protein, said protein comprising a 

1 5 peptide having the following sequence: VHN or VHGG. 

77. The preparation according to claim 76 wherein the protein has an isoelectric point 
(pi) of about 5.13 and an apparent molecular weight (MW) of about 20730. 

20 78. The preparation according to claim ?7, wherein the pi is within 10% of 5.13 and the 

MW is within 10% of 20730. 


The preparation according to claim 77, wherein the pi is within 5% of 5.13 and the 
MW is within 5% of 20730. 
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80. The preparation according to claim 77, wherein the pi is within 1% of 5.13 and the 
MW is within 1% of 20730. 

8 1 . A preparation comprising an isolated human protein, said protein comprising a 

5 peptide having the following sequence: PFP or PMP. 

82. The preparation according to claim 81 wherein the protein has an isoelectric point 
(pi) of about 6.37 and an apparent molecular weight (MW) of about 41260. 

10 83 . The preparation according to claim 82, wherein the pi is within 1 0% of 6.37 and the 

MW is within 10% of 41260. 

84. The preparation according to claim 82, wherein the pi is within 5% of 6.37 and the 
MW is within 5% of 41260. 

15 

85. The preparation according to claim 82, wherein the pi is within 1% of 6.37 and the 
MW is within 1% of 41260. 

86. A preparation comprising an isolated human protein, said protein comprising a 

20 peptide having the following sequence: VPN or VPGG. 

87. The preparation according to claim 86 wherein the protein has an isoelectric point 
(pi) of about 6.20 and an apparent molecular weight (MW) of about 67280. 

25 88. The preparation according to claim 87, wherein the pi is within 10% of 6.20 and the 
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MW is within 10% of 67280. 


89. The preparation according to claim 87, wherein the pi is within 5% of 6.20 and the 
MW is within 5% of 67280. 


90. The preparation according to claim 87, wherein the pi is within 1% of 6.20 and the 
MW is within 1% of 67280. 

91 . A preparation comprising an isolated human protein, said protein comprising one or 
more peptides having the following sequence: FF, FM, MF, MM, EN or EGG. 

92. The preparation according to claim 91 wherein the protein has an isoelectric point 
(pi) of about 6.72 and an apparent molecular weight (MW) of about 47550. 

93. The preparation according to claim 92, wherein the pi is within 10% of 6.72 and the 
MW is within 10% of 47550. 


94. The preparation according to claim 92, wherein the pi is within 5% of 6.72 and the 
MW is within 5% of 47550. 


95. The preparation according to claim 92, wherein the pi is within 1% of 6.72 and the 
MW is within 1% of 47550. 

96. An antibody capable of immunospecific binding to one of the following Breast 
Cancer-Associated Protein Isoforms (BPIs): BPM, BPI-5, BPI-6, BPI-9, BPI-10, 
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BPI-1 1, BPI-12, BPI-13, BPI- 14, BPI-19, BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, 
BPI-27, BPI-28, BPI-29, BPI-31, BPI-32, BPI-33, BPI-34, BPI-37, BPI-40, BPI-41, 
BPI-42, BPI-43, BPI-44, BPI-45, BPI-46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-51, 
BPI-52, BPI-53, BPI-54, BPI-55, BPI-56. 


5 

97. The antibody of claim 96, which is a monoclonal antibody. 

98. The antibody of claim 96 or 97, which binds to the BPI with greater affinity than to 
another isoform of the BPI. 

0 

99. The antibody of claim 96, which binds to the BPI with greater affinity than to any 
other iso form of the BPI. 

100. A kit comprising one or more antibodies as claimed in claim 96 or claim 97. 
i5 

101. A pharmaceutical composition comprising a therapeutically effective amount of an 
antibody as claimed in claim 96 or claim 97 and a pharmaceutically acceptable 
carrier. 


20 102. A pharmaceutical composition comprising: 

a therapeutically effective amount of a fragment or derivative of an antibody as 
claimed in claim 96 or claim 97, said fragment or derivative containing the binding 
domain of the antibody; and 

a pharmaceutically acceptable carrier. 

25 103. A method of treating or preventing breast cancer comprising administering to a 
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subject in need of such treatment or prevention a therapeutically effective amount of 
a nucleic acid encoding one of the following Breast Cancer-Associated Protein 
Isoforms (BPIs): BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, BPI-1 1, BPI-12, BPI-13, BPI- 
14, BPI-1 9, BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, BPI-27, BPI-28, BPI-29, BPI- 
31, BPI-32, BPI-33, BPI-34, BPI-37, BPI-40, BPI-41, BPI-42, BPI-43, BPI-44, BPI- 
45, BPI-46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-5 1, BPI-52, BPI-53, BPI-54, BPI- 
55, BPI-56. 

104. A method of treating or preventing breast cancer comprising administering to a 

subject in need of such treatment or prevention a therapeutically effective amount of 
a nucleic acid that inhibits the function of one or more of the following Breast 
Cancer-Associated Protein Isoforms (BPIs): BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, 
BPI-1 1, BPI-12, BPI-13, BPI-14, BPI-19, BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, 
BPI-27, BPI-28, BPI-29, BPI-31, BPI-32, BPI-33, BPI-34, BPI-37, BPI-40, BPI-41, 
BPI-42, BPI-43, BPI-44, BPI-45, BPI-46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-5 1, 
BPI-52, BPI-53, BPI-54, BPI-55, BPI-56. 

105. The use of a therapeutically effective amount of a nucleic acid encoding one of the 
following Breast Cancer-Associated Protein Isoforms (BPIs): BPI-1, BPI-5, BPI-6, 
BPI-9, BPI-10, BPI-1 1, BPI-12, BPI-13, BPI-14, BPI-19, BPI-20, BPI-21, BPI-23, 
BPI-24, BPI-25, BPI-27, BPI-28, BPI-29, BPI-31, BPI-32, BPI-33, BPI-34, BPI-37, 
BPI-40, BPI-41, BPI-42, BPI-43, BPI-44, BPI-45, BPI-46, BPI-47, BPI-48, BPI-49, 
BPI-50, BPI-5 1, BPI-52, BPI-53, BPI-54, BPI-55, BPI-56 in the manufacture of a 
medicament for use in the prevention or treatment of breast cancer. 

106. The use of a therapeutically effective amount of a nucleic acid that inhibits the 
function of one or more of the following Breast Cancer-Associated Protein Isoforms 
(BPIs): BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, BPI-1 1, BPI-12, BPI-13, BPI-14, BPI- 
19, BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, BPI-27, BPI-28, BPI-29, BPI-31, BPI- 
32, BPI-33, BPI-34, BPI-37, BPI-40, BPI-41, BPI-42, BPI-43, BPI-44, BPI-45, BPI- 
46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-5 1, BPI-52, BPI-53, BPI-54, BPI-55, BPI- 
56 in the manufacture of a medicament for use in the prevention or treatment of 
breast cancer. 
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107. The method of claim 104 or claiml06 , wherein the nucleic acid is a BPI antisense 
nucleic acid or ribozyme. 

108. A method of screening for agents that interact with a BPI, a BPI fragment, or a BPI- 
related polypeptide, said method comprising: 

(a) contacting a BPI, a biologically active portion of a BPI, or a BPI-related 
polypeptide with a candidate agent; and 

(b) determining whether or not the candidate agent interacts with the BPI, the 
BPI fragment, or the BPI-related polypeptide. 

109. The method of claim 106, wherein the BPI, the BPI fragment, or the BPI-related 
polypeptide is expressed by cells. 

1 1 0. The method of claim 1 07, wherein the cells express a recombinant BPI, a 
recombinant BPI fragment, or a recombinant BPI-related polypeptide. 

111. A method of screening for agents that modulate the expression or activity of a BPI or 
a BPI-related polypeptide comprising: 

(a) contacting a first population of cells expressing a BPI or a BPI-related 
polypeptide with a candidate agent; 

(b) contacting a second population of cells expressing said BPI or said BPI-related 
polypeptide with a control agent; and 

(c) comparing the level of said BPI or said BPI-related polypeptide or mRNA 
encoding said BPI or said BPI-related polypeptide in the first and second 
populations of cells, or comparing the level of induction of a cellular second 
messenger in the first and second populations of cells. 

112. The method of claim 109, wherein the level of said BPI or said BPI-related 
polypeptide, mRNA encoding said BPI or said BPI-related polypeptide, or said 
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cellular second messenger is greater in the first population of cells than in the second 
population of cells. 

113. The method of claim 1 09, wherein the level of said BPI or said BPI-related 
polypeptide, mRNA encoding said BPI or said BPI-related polypeptide, or said 
cellular second messenger is less in the first population of cells than in the second 
population of cells. 

1 14. A method of screening for or identifying agents that modulate the expression or 
activity of a BPI or a BPI-related polypeptide comprising: 

(a) administering a candidate agent to a first mammal or group of mammals; 

(b) administering a control agent to a second mammal or group of mammals; and 

(c) comparing the level of expression of the BPI or the BPI-related polypeptide or of 
mRNA encoding the BPI or the BPI-related polypeptide in the first and second 
groups, or comparing the level of induction of a cellular second messenger in the 
first and second groups. 


115. The method of claim 112, wherein the mammals are animal models for breast cancer. 

116. The method of claim 1 12 or 113, wherein the level of expression of said BPI or said 
BPI-related polypeptide, mRNA encoding said BPI or said BPI-related polypeptide, 
or of said cellular second messenger is greater in the first group than in the second 
group. 

117. The method of claim 1 1 2 or 1 1 3, wherein the level of expression of said BPI or said 
BPI-related polypeptide, mRNA encoding said BPI or said BPI-related polypeptide, 
or of said cellular second messenger is less than in the first group than in the second 
group. 

1 1 8. The method of claim 1 12, wherein the levels of said BPI or said BPI-related 
polypeptide, mRNA encoding said BPI or said BPI-related polypeptide, or of said 
cellular second messenger in the first and second groups are further compared to the 
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level of said BPI or said BPI-related polypeptide or said mRNA encoding said BPI or 
said BPI-related polypeptide in normal control mammals. 

1 1 9. The method of claim 1 12, wherein administration of said candidate agent modulates 
the level of said BPI or said BPI-related polypeptide, or said mRNA encoding said 
BPI or said BPI-related polypeptide, or said cellular second messenger in the first 
group towards the levels of said BPI or said BPI-related polypeptide or said mRNA 
or said cellular second messenger in the second group. 

120. The method of any one of claims 1 12 to 1 17, wherein said mammals are human 
subjects having breast cancer. 

121. A method of screening for or identifying agents that interact with a BPI or a BPI- 
related polypeptide, comprising 

(a) contacting a candidate agent with the BPI or the BPI-related polypeptide, and 

(b) quantitatively detecting binding, if any, between the agent and the BPI or the 
BPI-related polypeptide. 

120. A method of screening for or identifying agents that modulate the activity of a BPI or 
a BPI-related polypeptide, comprising 

(a) in a first aliquot, contacting a candidate agent with the BPI or the BPI-related 
polypeptide, and 

(b) comparing the activity of the BPI or the BPI-related polypeptide in the first 
aliquot after addition of the candidate agent with the activity of the BPI or the 
BPI-related polypeptide in a control aliquot, or with a previously determined 
reference range. 

121 . The method according to claim 1 19 or 120, wherein the BPI or the BPI-related 
polypeptide is recombinant protein. 
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122. The method according to any one of claims 1 1 9 to 1 2 1 , wherein the BPI or the BPI- 
related polypeptide is immobilized on a solid phase. 

123. An isolated nucleic acid molecule that hybridizes to a nucleotide sequence encoding 
BPI-4 1 , BPI-42, BPI-43, BPI-44, BPI-45, BPI-46, BPI-47, BPI-48, BPI-49, BPI-50, 

5 BPI-51 , BPI-52, BPI-53, BPI-54, BPI-55, BPI-56 or their complements. 

124. An isolated nucleic acid molecule that hybridizes to a nucleotide sequence encoding 
at least 10 consecutive amino acids of BPI-41, BPI-42, BPI-43, BPI-44, BPI-45, BPI- 
46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-5 1 , BPI-52, BPI-53, BPI-54, BPI-55, BPI-56 
or their complements. 

10 125. A vector comprising the nucleic acid molecule of claim 123 or 124. 

1 26. A host cell comprising the vector of claim 125. 

127. A host cell genetically engineered to express the nucleic acid molecule of claim 123 
or 124. 

1 28. A method for screening, diagnosis or prognosis of breast cancer in a subject or for 

15 monitoring the effect of an anti-breast cancer drug or therapy administered to a subject, 

comprising: 

(a) contacting at least one oligonucleotide probe comprising 10 or more consecutive 
nucleotides complementary to a nucleotide sequence encoding a BPI chosen from 
BPI-1, BPI-5, BPI-6, BPI-9, BPI-10, BPI-11, BPI-12, BPI-13, BPI-14, BPI-19, 

20 BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, BPI-27, BPI-28, BPI-29, BPI-31, BPI- 

32, BPI-33, BPI-34, BPI-37, BPI-40, BPI-41, BPI-42, BPI-43, BPI-44, BPI-45, 
BPI-46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-51, BPI-52, BPI-53, BPI-54, BPI- 
55, BPI-56 with an RNA obtained from a biological sample from the subject or 
with cDNA copied from the RNA wherein said contacting occurs under conditions 
25 that permit hybridization of the probe to the nucleotide sequence if present; 

(b) detecting hybridization, if any, between the probe and the nucleotide sequence; and 

(c) comparing the hybridization, if any, detected in step (b) with the hybridization 
detected in a control sample, or with a previously determined reference range.' 
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129. The method of claim 128, wherein step (a) comprises contacting a plurality of 
oligonucleotide probes comprising 10 or more consecutive nucleotides complementary 
to a nucleotide sequence encoding a BPI chosen from BPI-1, BPI-5, BPI-6, BPI-9, BPI- 

5 10, BPI-1 1, BPI- 12, BPI- 13, BPI-14, BPI-19, BPI-20, BPI-21, BPI-23, BPI-24, BPI-25, 

BPI-27, BPI-28, BPI-29, BPI-31, BPI-32, BPI-33, BPI-34, BPI-37, BPI-40, BPI-41, 
BPI-42, BPI-43, BPI-44, BPI-45, BPI-46, BPI-47, BPI-48, BPI-49, BPI-50, BPI-5 1, 
BPI-52, BPI-53, BPI-54, BPI-55, BPI-56 with an RNA obtained from a biological 
sample from the subject or with cDNA copied from the RNA wherein said contacting 
10 occurs under conditions that permit hybridization of the probe to the nucleotide 

sequence if present. 

130. The method of claim 128, wherein step (a) includes the step of hybridizing the 
nucleotide sequence to a DNA array, wherein one or more members of the array are the 
probes complementary to a plurality of nucleotide sequences encoding distinct BPIs. 

15 131. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

or moderately stringent conditions to the following nucleic acid sequence: GARTGYCAR. 

132. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GAGTGCCAG. 

133. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

20 or moderately stringent conditions to the following nucleic acid sequence: 

TGYCARGCKAC^GGNTTYWSNCCNMGN or TGYCARGCNACNGGNATGWSNCCNMGN. 

1 34. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 

25 TGCCAGGCCACCGGCTTCAGCCCCCGC or TGCCAGGCCACCGGCATGAGCCCCCGC. 

135. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
GAYGAYTTY or GAYGAYATG. 


135 



WO 01/13117 


PCT/G B00/03 143 


136. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
YTNGARTTYTTY CCNMGN, ATHGARTTYTTYCCNMGN, YTNGARATGTTYCCNMGN, 
YTNGARTTY ATGCCNMGN, YTNGARAGTAGTCCNMGN, ATHGARATGTTYCCNMGN, 

ATHG ARTTY ATGCCNMGN, or ATHGARATGATGCCNMGN. 

137. The isolated nucleic acid molecule according to claim 135, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: YTNGARTTYTTYCCNMGN, 
ATHGARTTYTTYCCNMGN, YTNGARATGTTYCCNMGN, YTNGARTTYATGCCNMGN, 
YTNGARAGTAGTCCNMGN, ATHGARATGTTYCCNMGN, ATHGARTTYATGCCNMGN, or 
ATHGARATGATGCCNMGN. 

138. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GACGACTTC 
or GACGACATG. 

139. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
CTGGAGTTCTTCCCCCGC, ATCGAGTTCTTCCCCCGC, CTGGAGATGTTCCCCCGC, 
CTGGAGTTCATGCCCCGC, CTGGAGAGTAGTCCCCGC, ATCGAGATGTTCCCCCGC, 
ATCGAGTTCATGCCCCGC, or ATCGAGATGATGCCCCGC. 

140. The isolated nucleic acid molecule according to claim 138, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: CTGGAGTTCTTCCCCCGC, ATCGAGTTCTTCCCCCGC, 
CTGGAGATGTTCCCCCGC, CTGGAGTTCATGCCCCGC, CTGGAGAGTAGTCCCCGC, 
ATCGAGATGTTCCCCCGC, ATCGAGTTCATGCCCCGC, or ATCGAGATGATGCCCCGC. 

141 . An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GCNAAY or 
GCNGGNGGN. 

142. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GCCAAC or 
GCCGGCGGC. 

143. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
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or moderately stringent conditions to the following nucleic acid sequence: GTNTAYCAR 

144. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
YTNYTNGARAAY, YTNATHGARAAY, ATHYTNGARAAY, ATHATHGARAAY, 

5 YTNYTNGARGGNGGN, YTNATHGARGGNGGN, ATHYTNGARGGNGGN, 

ATHATHGARGGNGGN. 

145. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: CCNGCN. 

146. The isolated nucleic acid molecule according to claim 143, wherein the nucleic acid 

10 also hybridizes under highly stringent conditions or moderately stringent conditions to 

the following nucleic acid sequence: YTNYTNGARAAY, YTNATHGARAAY, 
ATHYTNGARAAY, ATHATHGARAAY, YTNYTNGARGGNGGN, YTNATHGARGGNGGN, 
ATHYTNGARGGNGGN, ATHATHGARGGNGGN. 

147. The isolated nucleic acid molecule according to claim 143, wherein the nucleic acid 

1 5 also hybridizes under highly stringent conditions or moderately stringent conditions to 

the following nucleic acid sequence: CCNGCN. 

148. The isolated nucleic acid molecule according to claim 143, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: YTNYTNGARAAY, YTNATHGARAAY, 

20 ATHYTNGARAAY, ATHATHGARAAY, YTNYTNGARGGNGGN, YTNATHGARGGNGGN, 

ATHYTNGARGGNGGN, ATHATHGARGGNGGN, and also hybridizes under highly stringent 
conditions or moderately stringent conditions to the following nucleic acid sequence: 
CCNGCN. 

149. The isolated nucleic acid molecule according to claim 144, wherein the nucleic acid 

25 also hybridizes under highly stringent conditions or moderately stringent conditions to 

the following nucleic acid sequence: CCNGCN. 

1 50. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: gtgtaccag. 

151. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
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or moderately stringent conditions to the following nucleic acid sequence: 

, CTGCTGGAGAAC, CTGATCGAGAAC, ATCCTGGAGAAC, ATCATCGAGAAC, 
CTGCTGGAGGGCGGC, CTGATCGAGGGCGGC, ATCCTGGAGGGCGGC, or 
ATCATCGAGGGCGGC. 

1 52. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: CCCGCC. 

153. The isolated nucleic acid molecule according to claim 150, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: CTGCTGGAGAAC, CTGATCGAGAAC, 
ATCCTGGAGAAC, ATCATCGAGAAC, CTGCTGGAGGGCGGC, CTGATCGAGGGCGGC, 
ATCCTGGAGGGCGGC, or ATCATCGAGGGCGGC. 

1 54. The isolated nucleic acid molecule according to claim 1 50, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: CCCGCC. 

155. The isolated nucleic acid molecule according to claim 150, wherein the nucleic-acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: CTGCTGGAGAAC, CTGATCGAGAAC, 
ATCCTGGAGAAC, ATCATCGAGAAC, CTGCTGGAGGGCGGC, CTGATCGAGGGCGGC, 
ATCCTGGAGGGCGGC, or ATCATCGAGGGCGGC and also hybridizes under highly 
stringent conditions or moderately stringent conditions to the following nucleic acid 
sequence: CCCGCC. 

1 56. The isolated nucleic acid molecule according to claim 151, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: CCCGCC. 

157. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
TGYTAYTGYCARAAR. 

1 58. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
TGCTACTGCCAGAAG. 
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1 59. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
YTNGAYGAYTAYYTNAAY, YTNGAYGAYTAYATHAAY, ATHGAY GAYTA YYTNAAY, 
ATHGAYGAYTAYATHAAY, YTNGAYGAYTAYYTNGGNGGN, 

5 . YTNGAYGAYTAYATHGGNGGN, ATHGAYGAYTAYYTNGGNGGN, or 

ATCGAYGAYTAYATHGGNGGN. 

160. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: CAYGCNCAR. 

161. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

10 or moderately stringent conditions to the following nucleic acid sequence: GARYTN or 

GARATH. 

162. The isolated nucleic acid molecule according to claim 159, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: CAYGCNCAR. 

15 163. The isolated nucleic acid molecule according to claim 159, wherein the nucleic acid 

also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: GARYTN or GARATH. 

164. The isolated nucleic acid molecule according to claim 1 59, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 

20 the following nucleic acid sequence: CAYGCNCAR and also hybridizes under highly 

stringent conditions or moderately stringent conditions to the following nucleic acid 
sequence: GARYTN or GARATH. 

165. The isolated nucleic acid molecule according to claim 160, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 

25 the following nucleic acid sequence: GARYTN or GARATH. 

166. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
CTGGACGACTACCTGAAC, CTGGACGACTACATCAAC, ATCGACGACTACCTGAAC, 

ATCG ACG A CTACATCAAC, CTGGACGACTACCTGGGCGGC, CTGGACGACTACATCGGCGGC, 
30 ATCGACGACTACCTGGGCGGC, or ATCGACGACTACATCGGCGGC. 
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167. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: CACGCCCAG. 

168. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GAGCTG or 
GAGATC. 

1 69. The isolated nucleic acid molecule according to claim 167, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: GAGCTG or GAGATC 

170. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 

TTY GGNCCNGTNCCNMGN or ATGGGNCCNGTNCCNMGN. 

171. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
TTCGGCCCCGTGCCCCGC or ATGGGCCCCGTGCCCCGC. 

1 72. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: TAYTGYACN. 

1 73. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
GTNGTNGARGAR. 

174. The isolated nucleic acid molecule according to claim 173, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: GTNGTNGARGAR. 

175. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: TACTGCACC. 

1 76. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
GTGGTGGAGGAG. 

177. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
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or moderately stringent conditions to the following nucleic acid sequence: 
TGGYTNGGNGAY, GAYGCNYTNGGNGAY, GCNGAYYTNGGNGAY, GARGGNYTNGGNGAY, 
GGNGARYTNGGNGAY, GTNWSNYTNGGNGAY, WSNGTNYTNGGNGAY, TGGATHGGNGA Y , 
GAYGCNATHGGNGAY, GCNGAYaTHGGNGAY, GARGGNATHGGNGAY, 

5 GGNGARATHGGNGAY, GTNWSNATHGGNGAY, or WSNGTNATHGGNGAY. 

178. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
CARTGYGTNGTNGAYTTYTTYMGN, CARTGYGTNGTNGAYATGTTYMGN, 
CARTGYGTNGTNGAYTTYATGMGN, or CARTGYGTNGTNGAYATGATGMGN. 

10 179. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

or moderately stringent conditions to the following nucleic acid sequence: 
TGGCTGGGCGAC, GACGCCCTGGGCGAC, GCCGACCTGGGCGAC, GAGGGCCTGGGCGAC, 
GGCGAGCTGGGCGAC, GTGAGCCTGGGCGAC, AGCGTGCTGGGCGAC, TGGATCGGCGAC, 
GACGCCATCGGCGAC, GCCGACATCGGCGAC, GAGGGCATCGGCGAC, 

1 5 GGCGAGATCGGCGAC, GTGAGCATCGGCGAC, orAGCGTGATCGGCGAC. 

1 80. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
CAGTGCGTGGTGGACTTCTTCCGC, CAGTGCGTGGTGGACATGTTCCGC, 
CAGTGCGTGGTGGACTTCATGCGC, or CAGTGCGTGGTGGACATGATGCGC. 

20 181. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

or moderately stringent conditions to the following nucleic acid sequence: 
TGGYTNCARGTN, GAYGCNYTNCARGTN, GCNGAYYTNCARGTN, GARGGNYTNCARGTN, 
GGNGARYTNCARGTN, GTNWSNYTNCARGTN, W SNGTNYTN C ARGTN, TGGATHCARGTN, 
GAYGCNATHCARGTN, GCNG A Y ATHC ARGTN, GARGGNATHCARGTN, 

25 GGNGARATHCARGTN, GTNWSNATHCARGTN, or W SNGTN ATHC ARGTN. 

1 82. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: TAYTTYGTN 
or TAYATGGTN. 

183. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

30 or moderately stringent conditions to the following nucleic acid sequence: 

TGGYTNCARGGN, GAYGCKYTNCARGGN, GCNGAYYTNCARGGN, GARGGNYTNCARGGN, 
GGNGARYTNCARGGN, GTNWSNYTNCARGGN, WSNGTNYTNCARGGN, TGGATHCARGGN, 
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GAYGCNATHCARGGN, GCNGA Y ATHCARGGN, GARGGNATHCARGGN, 
GGNGARATHCARGGN, GTNWSNATHCARGGN, or WSNGTNATHCARGGN. 

1 84. The isolated nucleic acid molecule according to claim 1 83, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: TAYTTYGTN or TAYATGGTO 

1'85. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
TGGCTGCAGGTG, GACGCCCTGCAGGTG, GCCGACCTGCAGGTG, GAGGGCCTGCAGGTG, 
GGCGAGCTGCAGGTG, GTGAGCCTGCAGGTG, AGCGTGCTGCAGGTG, TGGATCCAGGTG, 
GACGCCATCCAGGTG, GCCGACATCCAGGTG, GAGGGCATCCAGGTG, 
GGCGAGATCCAGGTG, GTGAGCATCCAGGTG, or AGCGTGATCCAGGTG. 

1 86. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

or moderately stringent conditions to the following nucleic acid sequence: TACTTCGTG 
or TACATGGTG. 

187. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
TGGGTGCAGGGC, GACGCCCTGCAGGGC, GCCGACCTGCAGGGC, GAGGGCCTGCAGGGC, 
GGCGAGCTGCAGGGC, GTGAGCCTGCAGGGC, AGCGTGCTGCAGGGC, TGGATCCAGGGC, 
GACGCCATCCAGGGC, GCCGACATCCAGGGC, GAGGGCATCCAGGGC, 
GGCGAGATCCAGGGC, GTG AGC ATCCA GGGC, or AGCGTGATCCAGGGC. 

188. The isolated nucleic acid molecule according to claim 1 87, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: TACTTCGTG or TACATGGTG. 

1 89. The isolated nucleic acid molecule according to claim 223, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: TACTTCGTG or TACATGGTG and also hybridizes 
under highly stringent conditions or moderately stringent conditions to the following 
nucleic acid sequence: TGGCTGCAGGGC, GACGCCCTGCAGGGC, GCCGACCTGCAGGGC, 
GAGGGCCTGCAGGGC, GGCGAGCTGCAGGGC, GTGAGCCTGCAGGGC, 
AGCGTGCTGCAGGGC, TGGATCCAGGGC, GACGCCATCCAGGGC, GCCGACATCCAGGGC, 
GAGGGCATCCAGGGC, GGCGAGATCCAGGGC, GTGAGCATCCAGGGC, or 
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AGCGTGATCCA GGGC. 

190. The isolated nucleic acid molecule according to claim 224, wherein the nucleic acid 
also hybridizes under highly stringent conditions or moderately stringent conditions to 
the following nucleic acid sequence: TGGCTGCAGGGC, GACGCCCTGCAGGGC, 

5 GCCGACCTGCAGGGC, GAGGGCCTGCAGGGC, GGCGAGCTGCAGGGC, 

GTGAGCCTGCAGGGC, AGCGTGCTGCAGGGC, TGGATCCAGGGC, GACGCCATCCAGGGC, 
GCCGACATCCAGGGC, GAGGGCATCCAGGGC, GGCGAGATCCAGGGC, 
GTGAGCATCCAGGGC, or AGCGTGATCCAGGGC. 

191. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

10 or moderately stringent conditions to the following nucleic acid sequence: 

GAYGARWSNYTNCARGTNGCNGARMGN or GAYGARWSNATHCARGTNGCNGARMGN. 

1 92. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: 
GACGAGAGCCTGCAGGTGGCCGAGCGC or GACGAGAGCATCCAGGTGGCCGAGCGC. 

15 193. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

or moderately stringent conditions to the following nucleic acid sequence: GTNCAYAAY 
or GTNCAYGGNGGN. 

1 94. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GTGCACAAC 

20 orGTGCA CGGCGGC. 

195. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: CCNTTYCCN 
or CCNATGCCN 

196. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

25 or moderately stringent conditions to the following nucleic acid sequence: CCCTTCCCC 

or CCCATGCCC. 

197. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GTNCCNAAY 
or GTNCCNGGNGGN. 

30 198. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
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or moderately stringent conditions to the following nucleic acid sequence: GTGCCCAAC 
or GTGCCCGGCGGC. 

1 99. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: TTYTTY, 
TTYATG, ATGTTY, or ATGATG. 

200. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: GARAAY or 
GARGGNGGN. 

201. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 
or moderately stringent conditions to the following nucleic acid sequence: TTCTT C, 
TTCATG, ATGTTC, or ATGATG. 

202. An isolated nucleic acid molecule that hybridizes under highly stringent conditions 

or moderately stringent conditions to the following nucleic acid sequence: GAGAAC or 
GAGGGCGGC. 
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